I. V. Savelyev 

PHYSICS 

A General Course 

MECHANICS 

MOLECULAR 

PHYSICS 



Mir Publishers 

Moscow 


H. B. CABEJlbEB 
KYPG OBH],EH QH3HKH 
TOM I 
MEXAHHKA 

MOJIEKYJIHPHAfl (DH3HKA 


H3^ATEJTbCTBO «HAYKA* 
MOCKBA 



I. V. SAVELYEV 


PHYSICS 

A GENERAL COURSE 

(In three volumes) 


VOLUME I 

MECHANICS 

MOLECULAR 

PHYSICS 



MIR PUBLISHERS 
MOSCOW 



Translated from Russian by G* Leib 


First published 1980 

Revised from the 1977 Russian edition 

Second printing 1985 

Third printing 1989 


< 

' c 


if' * . . t f ’ * • % • 

• «* - * * J ** 

Ha atUAUiCKOM JUMW <• . vV ,-y..- •’ ' '’ V' 


Printed in the Union of Soviet Socialist Republics 


ISBN 5-03-000901 -9 
ISBN 5-03-000900-0 


© M3naTe/ibCTBO «HayKa», 1977 
© English translation Mir Publishers, 1980 



AUTHOR’S PREFACE TO 
THE ENGLISH EDITION 


The present book is the first volume of a three-volume general 
course in physics. The course is a result of twenty five year’s work 
in the Department of General Physics of the Moscow Institute of 
Engineering Physics. I was in constant personal contact with my 
students not only at lectures* but also, perhaps even more important- 
ly, at exercises, consultations, and examinations. These fruitful 
contacts helped me refine and improve the exposition of the various 
topics in the course. 

The advice and friendly criticism of my colleagues in the depart- 
ment has also been a great help. I would like to make special mention 
of the part played by N. B. Narozhny, who, in particular, is the 
author of the original and comparatively simple statistical deriva- 
tion of the equation dS = d'QIT [Eq. (11.110)]. 

In writing the book, I have done everything in my power to 
acquaint students with the basic ideas and methods of physics 
and to teach them how to think physically. This is why the book 
is not encyclopedic in its nature. It is mainly devoted to explaining 
the meaning of physical laws and showing how to apply them con- 
sciously. What I have tried to achieve is a deep knowledge of the 
fundamental principles of physics rather than a s hallower acquain- 
tance with a wide range of questions. 

While using the book, try not to memorize the material formalisti- 
cally and mechanically, but logically, i.e. memorize the material 
by thoroughly understanding it. I have tried to present physics 
not as a science for “ cramming ”, not as a certain volume of infor- 
mation to be memorized, but as a clever, logical, and attractive 
science. It is left to the reader to judge the extent to which I have 
succeeded in doing this. 

Acknowledging the fact that a thick volume by its very appearance 
makes a student despondent, I have done my utmost to limit the 
size of the course. This was achieved by carefully choosing the 
material which in my opinion should be included in a general course 
of physics. I also tried to be concise, but not at the expense of clarity. 

Notwithstanding my desire to reduce the size, I considered it 
essential to include a number of mathematical sections in the course: 
on vectors, linear differential equations, the basic concepts of the 
theory of probability, etc. This was done to impart a “physical” 
tinge to the relevant concepts and relations. In addition, the mathe- 
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matical “inclusions” make it possible to go on with the physics 
even if, as is often the case, the relevant material has not yet been 
covered in a mathematics course. 

The present course is intended above all for higher technical 
schools with an extended syllabus in physics. The material has been * 
arranged, however, so that the book can be used as a teaching aid 
for higher technical schools with an ordinary syllabus simply by 
omitting some sections. 

Igor Savelyev 

Moscow, July, 1979 
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INTRODUCTION 


Physics is a science dealing with the most general properties and 
forms of motion of matter. 

A classical definition of matter was given by V. Lenin in his book 
Materialism and Empirio-Criticism : “Matter is a philosophical 

category denoting the objective reality which is given to man by 
his sensations, and which is copied, photographed and reflected by 
our sensations, while existing independently of them”*. Two propo- 
sitions are significant in this definition, namely, (1) matter is what 
exists objectively, i.e. independently of anyone’s consciousness or 
sensations, and (2) matter is copied and reflected by our sensations 
and, consequently, is cognizable. 

It follows from the definition of physics that it concentrates 
knowledge accumulated on the most general properties and phenom- 
ena of the world surrounding us. As academician S. Vavilov noted 
in one of his articles, “the extremely common character of a consid- 
erable part of the contents of physics, its facts and laws drew 
physics and philosophy together from time immemorial — Sometimes 
physical statements have such a nature that they are difficult to 
distinguish and separate from philosophical statements, and a phys- 
icist must be a philosopher”. 

Two kinds of matter are known at present: substance and field. 
The first kind of matter — substance — includes, for example, atoms, 
molecules, and all bodies built of them. Electromagnetic, gravita^ 
tional, and other fields form the second kind of matter. The di£| 
ferent kinds of matter can change into each other. For instance, an 
electron and a positron (representatives of substance) may trans- 
form into photons (i.e. into an electromagnetic field). The reverse 
process is also possible. 

Matter is in continuous motion, which is understood to mean any 
change in general in dialectical materialism**. Motion is an in- 

♦ V. I. Lenin. Collected Works, Vol. 14, p. 130. Moscow, Foreign Languages 
Publishing House (1962). 

** Dialectical materialism is the name given to the Marxist-Leninist philo- 
sophy. The fundamental issue of any philosophy as to what is primary — matter 
or consciousness — is solved by dialectic materialism in favour of matter when 
it states that matter is primary and consciousness is secondary. The method 
of this philosophy is dialectics. It considers matter in constant motion and de^ 
velopment whose source is contained in the internal contradictions inherent in 
objects and phenomena themselves. 
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alienable property of matter, which, like matter itself, cannot be 
created or destroyed. Matter exists and moves in space and in time, 
which are forms of existence of matter. 

The laws of physics are established by generalizing experimental 
facts. They express the objective regularities existing in nature. 
These laws are customarily expressed in the form of quantitative 
relationships between various physical quantities. 

The fundamental method of investigation in physics is the running 
of an experiment, i.e. the observation of the phenomenon being 
studied in accurately controlled conditions. The latter must permit 
one to watch the course of the phenomenon and reproduce it each 
time when these conditions are repeated. Phenomena can be pro- 
duced experimentally that are not observed in nature. For example, 
more than ten of the chemical elements known at present have 
meanwhile not been discovered in nature and were obtained arti- 
ficially by means of nuclear reactions. 

Hypotheses are enlisted to explain experimental data. A hypothe- 
sis is a scientific assumption advanced to explain a definite fact 
or phenomenon and requiring verification and proving to become 
a scientific theory or law. The correctness of a hypothesis is verified 
by running the corresponding experiments and by determining 
whether the corollaries following from the hypothesis agree with 
the results of experiments and observations. A hypothesis that has 
successfully passed such verification and has been proved becomes 
a scientific law or theory. 

A physical theory is a system of basic ideas summarizing experi- 
mental data and reflecting the objective regularities of nature. 
A physical theory explains a whole field of natural phenomena 
from a single viewpoint. 

Physics is subdivided into the so-called classical physics and 
quantum physics. The term classical is applied to the physics whose 
creation was completed at the beginning of the 20th century. Clas- 
sical physics was founded by Isaac Newton (1642-1727), who for- 
mulated the fundamental laws of classical mechanics. Newtonian 
^mechanics proved to be exceedingly fruitful and mighty, and phys- 
icists acquired t he conviction that any physical phenomenon can be 
explained with the aid of Newton’s laws. 

The edifice of classical physics built up by the end of the last 
century was very harmonious. Most physicists were convinced that 
they already knew everything about nature that could be known. 
The most perspicacious physicists, however, understood that the 
edifice of classical physics had weak spots. For example, the British 
physicist William Thomson (Lord Kelvin, 1824-1907) said that there 
are two dark clouds on the horizon of the cloudless sky of classical 
physics — the unsuccessful attempts to set up a theory of blackbody 
radiation, and the contradictory behaviour of ether — the hypothetic 
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medium in which light waves were supposed to propagate. The 
persistent attempts to surmount these difficulties led to unexpected 
results. To solve these problems, which were beyond the possibili- 
ties of classical physics, it became necessary to revise quite radically 
the established, habitual notions and introduce concepts that were 
alien to the spirit of classical physics. Max Planck (1858-1947) 
succeeded in solving the problem of blackbody radiation in 1900 by 
introducing the concept of light emission in separate portions — 
quanta. Thus, at the t hreshold of the 20th century, the concept of 1 
the quantum appeared. It plays an exceedingly important part in 
modern physics and has resulted in the creation of quantum mechan- 
ics. 

The contradictory nature of the experimental facts relating to 
ether induced Albert Einstein (1879-1955) to revise the notions of 
space and time that were considered to be obvious from Newton’s 
times. The result was the appearance of the theory of relativity. 
The latter gives equations of motion appreciably differing from 
those of Newtonian mechanics for bodies travelling with speeds that 
are noticeable in comparison with the speed of light. 

The year 1897 saw the discovery of the electron. The atoms of all 
the chemical elements were found to contain these particles. Thus, 
atoms, previously considered indivisible, appeared to have a com- 
plicated structure. 

The beginning of the 20th century was thus marked in physics 
by the radical breaking down of numerous habitual concepts and 
notions. New physical discoveries and theories destroyed the notions 
of the structure of matter formed by many physicists. Some of them 
interpreted this as the vanishing of matter. Many physicists lapsed 
into idealism, and a crisis began in physics. 

V. Lenin in his book Materialism and Empirio-Criticism written 
in 1908 gave annihilating criticism of “physical” idealism. He showed 
that the new discoveries indicate not the vanishing of matter, but 
the vanishing of the limit up to which matter was known before 
that time. “‘Matter disappears’”, wrote Lenin, “means that the 
limit within which we have hitherto known matter disappears and 
that our knowledge is penetrating deeper; properties of matter are 
likewise disappearing which formerly seemed absolute, immutable, 
and primary (impenetrability, inertia, mass, etc.) and which are 
now revealed to be relative and characteristic only of certain states 
of matter. For the sole ‘property’ of matter with whose recognition 
philosophical materialism is bound up is the property of being 
an objective reality , of existing outside the mind.”*. 


V. I. Lenin. Collected Works, Vol. 14, p. 260. Moscow, Foreign Languages 
Publishing House (1962). 
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The process of cognizing the world is infinite. Our knowledge at 
any given stage of development of science is due to the historically 
achieved level of cognition and cannot be considered as final or 
complete. It is of necessity relative knowledge, i.e. requires further 
development, further verification, and more precise definition. 
At the same time, any truly scientific theory, notwithstanding its 
relativity and incompleteness, contains elements of absolute, i.e/ 
complete, knowledge, and thus signifies a step in the cognition of 
the objective world. For instance, mechanics based on Newton’s 
laws is not correct, strictly speaking. But for a certain range of phe- 
nomena, this mechanics is quite satisfactory. Thus, the develop- 
ment of science did not cross out Newtonian mechanics. It only 
established the limits within which it is correct. Newtonian mechan- 
ics formed a constituent part of the general edifice of the physical 
science. 

The beginning of the 20th century is characterized by persistent 
attempts to penetrate into the internal structure of atoms. The key 
to determining their structure was found to be the studying of 
atomic spectra. The theory of the atom developed by Niels Bohr 
(1885-1962) in 1913 was the first striking success in explaining the 
observed spectra. This theory, however, has obvious features of 
inconsistency: in addition to the motion of an electron in an atom 
obeying the laws of classical mechanics, the theory imposes special 
quantum restrictions on this motion. The theory soon had to pay 
for this lack of consistency. After the first successes in explaining 
the spectra of the simplest atom — that of hydrogen — it was found 
that Bohr’s theory is unable to explain the behaviour of atoms with 
two or more electrons. 

The need to develop a new comprehensive theory of atoms became 
pr essing . A bold hypothesis of Louis de Broglie put forward in 
1924 placed the cornerstone in such a theory. It was known by that 
time that light, while being a wave process, also exhibits a corpus- 
cular nature in a number of cases, i.e. behaves like a stream of parti- 
cles. De Broglie put forth the idea that the particles of a substance, 
in turn, should display wave properties too in definite conditions. 
De Broglie’s hypothesis soon received a brilliant experimental 
confirmation — it was proved that a wave process is associated with 
the particles of a substance, and it must be taken into account when 
considering the mechanics of an atom. A result of this discovery 
was the development by Erwin Schrodinger (1887-1961) and Wer- 
ner Heisenberg (1901-1976) of a new physical theory — wave or 
quantum mechanics. The latter achieved striking successes in 
explaining atomic processes and the structure of a substance. Results 
were obtained that showed excellent agreement with experimental 
data when it was found possible to surmount the mathematical 
difficulties. 
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The latest decades were noted by remarkable achievements in the 
field of studying the atomic nucleus. Scientists and engineers have 
mastered nuclear processes to such an extent that the practical use 
of nuclear energy has become possible. One of the leading places 
in this field belongs to Soviet physics. Particularly, the first atomic 
power plant in the world was erected in the USSR. 

Finally, in recent years, the walls of laboratories created by the 
hands of man were moved apart beyond the limits of our globe. 
On October 4, 1957, an artificial satellite of the Earth was launched 
in the Soviet Union the first time in history. It was a small labora- 
tory outfitted with apparatus for scientific research. April 12, 1961, 
saw the first flight of a man into outer space. The first Soviet cosmo- 
naut, Yuri Gagarin, flew around the Earth and landed safely. The 
first space rockets were built in the Soviet Union. They left the 
field of the Earth’s attraction and transmitted to the Earth by 
means of radio signals valuable results of studying outer space and, 
particularly, photographs of the reverse side of the Moon. In 1969, 
U.S. astronauts landed on the Moon. In 1975, two Soviet automatic 
spaceships made a soft landing on Venus and transmitted valuable 
information on the physical conditions on this planet, and also 
photographs of its surface. 

There is no doubt that the nearest future will be marked with 
new fundamental discoveries in the science of physics. 



PART I THE PHYSICAL 

FUNDAMENTALS 
OF MECHANICS 


CHAPTER 1 KINEMATICS 


1.1. Mechanical Motion 

Mechanical motion is the simplest form of motion of matter. 
It consists in the movement of bodies or their parts relative to one 
another. We can see movements of bodies everywhere in our ordi- 
nary life. This is why mechanical notions are so clear. This also 
explains the fact that mechanics was the first of all the natural 
sciences to be developed very broadly. 

A combination of bodies separated for consideration is called 
a mechanical system. The bodies to be included in a system depend 
on the nature of the problem being solved. In a particular case, 
a system may consist of a single body. 

It was indicated above that motion in mechanics is defined as the 
change in the mutual arrangement of bodies. If we imagine a separate 
isolated body in a space where no other bodies are present, then we 
cannot speak of the motion of the body because there is nothing 
with respect to which the body could change its position. It thus 
follows that if we intend to study the motion of a body, then we 
must indicate with respect to what other bodies the given motion 
occurs. 

Motion occurs both in space and in time (space and time are 
in alienable f orms of existence of matter). Consequently, to describe 
motion, we must also determine time. We use a timepiece (watch 
or clock) for this purpose. 

A combination of bodies that are stationary relative to one another 
with respect to which motion is being considered and a timepiece 
indicating the time forms a reference frame. 

The motion of the same body relative to different reference frames 
may have a different nature. For example, let us imagine a train 
gaining speed. Suppose that a passenger is walking with a constant 
velocity along the corridor of one of the cars of the train. The motion 
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of the passenger relative to the car will be uniform, and relative 
to the Earth’s surface it will be accelerated. 

To describe the motion of a body means to indicate for every 
moment of time the position of the body in space and its velocity* 
To set the state of a mechanical system, we must indicate the posi- 
tions and the velocities of all the bodies forming the system. A typi- 
cal problem of mechanics consists in determining the states of a sys- 
tem at all the following moments of time t when we know the state 
of the system at a certain initial moment t 0 and also the laws govern- 
ing the motion. 

It must be noted that no physical problem can be solved absolutely 
exactly. An approximate solution is always obtained. The degree 
of approximation is determined by the nature of the problem and 
the object to be achieved. In solving a problem approximately, we 
disregard the factors that are not significant in the given case. For 
example, we may often disregard the dimensions of the body whose 
motion is being studied. For instance, it is quite possible to disregard 
the Earth’s dimensions when treating its motion about the Sun. 
This allows us to considerably simplify our description of the motion 
because the Earth’s position in space can be determined by a single 
point. 

A body whose dimensions may be disregarded in the conditions 
of a given problem is called a point particle (or simply a particle). 
Whether or not we may consider a given body as a particle depends 
not on the dimensions of the body, but on the conditions of the 
problem. The same body in some cases may be treated as a particle, 
but in others it must be considered as an extended body. 

When speaking about a body as a particle, we disengage ourselves 
from its dimensions. Another abstraction which we have to do with 
in mechanics is a perfectly rigid body. Absolutely undeformable 
bodies do not exist in nature. Any body deforms to a greater or 
smaller e xtent , i.e. changes its shape and dimensions, under the 
action of forces applied to it. The deformations of bodies when 
considering their movements may often be disregarded, however. 
If this is done, then the body is called perfectly rigid. Thus, a body 
whose deformations may be disregarded in the conditions of a given 
problem is called a perfectly rigid, or simply a rigid body. 

Any motion of a rigid body can be resolved into two basic kinds 
of motion — translational motion and circular motion. 

Translational motion (translation) is defined as motion in which 
any straight line associated with the moving body remains parallel 
to itself (Fig. 1.1). 

In circular motion (rotation), all the points of a body move in 
circles whose centres are on a single straight line called the axis 
of rotation (Fig. 1.2). The axis of rotation can be outside a body 
1 9 h\ 
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Since when treating a body as a particle we ignore its length, the 
concept of circular motion about an axis passing through such 
a body cannot be applied to it. 

To acquire the possibility of describing motion quantitatively, 
we have to associate a coordinate system (for example a Cartesian 




(a) (b) 

Fig. 1.1 Fig. 1.2 


one) with the bodies forming a reference frame. Hence, the position 
of a particle can be determined by setting the three numbers x, y, 
and z — the Cartesian coordinates of the particle. A coordinate 
system can be made by forming a rectangular lattice from identical 
rods or rules graduated to a defi- 
nite scalfe (Fig. 1.3). Identical 
clocks synchronized with one 
another must be placed at the 
lattice points. The position of a 
particle and the moment of time 
corresponding to this position 
are recorded on the graduated 
rods and the clock closest to the 
particle. 

It is simpler to treat a point 
particle than an extended body. 

We shall therefore first study the 
mechanics of a particle, and 
then go over to the mechanics of 
a rigid body. We shall start . _ Fig. 1.3 

with kinematics, and then delve ^ 

into dynamics. We remind our reader that kinematics studies the 
motion of bodies without regard to what causes this motion. 
Dynamics studies the motion of bodies with a view to what causes 
this motion to have the nature it does, i.e. with a view to the 
interactions between bodies. 
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1.2. Vectors 

Definition of a Vector. Vectors are defined as quantities character- 
ized by a numerical value and a direction and, also, as ones that 
are added according to the triangle or parallelogram method*. The 
last requirement is a very significant one. We can indicate quanti- 
ties characterized by a numerical value and a sense of direction, 
but that are added in a different way than vectors. We shall take 
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as an example the rotation of a body about an axis through the 
finite angle q). Such rotation can be depicted in the form of a segment 
of length q> directed along the axis about which rotation is occurring 
and pointing in a direction associated with that of rotation according 
to the right-hand screw rule. The top portion of Fig. 1.4 shows two 
consecutive turns of the sphere through the angles ji/2 depicted 
by the segments <p x and qp 2 . The first turn about axis 1-1 transfers 
point A of the sphere to position A', and the second turn about 
axis 2-2 transfers it to position A". The same result, i.e. transfer 
of point A to position A ", can be achieved by turning the sphere 
about axis 3-3 (see the bottom portion of Fig. 1.4) through the 
angle n. Hence, such a turn should be considered as the sum of the 
turns q> x and <p 2 . It cannot be obtained from the segments q> x and <p 2 , 
however, by adding them according to the parallelogram method. 
Such addition gives a segment of length instead of the required 

length n. Rotation through the angle n/|/ r 2 transfers point A to 
point A m . It thus follows that the turns through finite angles depicted 
by the directed segments do not have the properties of vectors. 


♦ According to a stricter definition , a vector is a combination of three quan- 
when the coordinate axes rotate according to a definite law. 
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The numerical value of a vector is called its magnitude. Figura- 
tively speaking, the magnitude of a vector indicates its length. 
The magnitude of a vector is a scalar, and always a positive one. 

Vectors are represented graphically by arrows. The length of an 
arrow determines to the established scale the magnitude of the 
relevant vector, and the arrow points in the direction of the vector. 

Vectors are customarily distinguished by setting their symbols 
in boldface type, for example, a, b, v and F. The same symbols 
set in italics signify the magnitude of the relevant vectors, for 
example, a is the magnitude of the vector a*. It is sometimes neces- 
sary to express the magnitude by placing a vertical bar (an absolute 
value sign) on each side of the symbol for the vector. Thus, | a | is 
the magnitude of the vector a. This representation is used, for 
example, to show the magnitude of the sum of the vectors a x and a a : 

| a x + a 2 | = magnitude of the vector (a x 4- a 2 ) (1.1) 

In this case, the notation a x + a 2 signifies the sum of the magnitudes 
of the vectors being added, which in general does not equal the 
magnitude of the sum of the vectors (the two sums will be equal 
only when the vectors being added have the same direction). 

Vectors directed along parallel straight lines (in the same or in 
opposite directions) are called collinear. Vectors in parallel planes 
are called coplanar. Collinear vectors can be arranged along the 
same straight line and coplanar vectors can be brought into one 
plane by parallel translation. 

Collinear vectors equal in magnitude and having the same direc- 
tion are considered to equal each other**. 

Vector Addition and Subtraction. It is more convenient to add 
vectors in practice without constructing a parallelogram. Examina- 
tion of Fig. 1.5 shows that we can achieve the same result if we bring 
the tail of the second vector in contact with the tip of the first one, 
and then draw the resultant vector from the tail of the first vector 
to the tip of the second one. It is very good to use this procedure 
when we have to add more than two vectors (Fig. 4.6). 

The difference of two vectors a and b is defined as such a vector c 
which when added to the vector b gives the vector a (Fig. 1.7 — the 
vector — b depicted by a dash line v ill be treated telow The mag- 


* In handwriting, vectors are denoted by arrows over their symbols (for 
— ► 

example, a). In this case, the same letter without the arrow stands for the mag- 
nitude of the vector. 

** What is meant are the so-called free vectors, i.e. vectors that can he drawn 
from any point in space. Also distinguished are slip vectors whose tail can be 
placed at any point on the straight line along which the vector is directed, and 
focalized vectors, which are applied to a definite point. The last two kinds of 
vectors can be expressed through free vectors. This is why vector calculus is 
based on the concept of the free vector, usually called simply a vector. 
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nitude of the difference of two vectors, like the magnitude of a sum 
[see Eq. (1.1)1, may be written only with the aid of vertical bars: 

| — a 2 | = magnitude of the vector (a x — a 2 ) (1.2) 

because the notation a x — a 2 signifies the difference of the magni- 
tudes of the vectors a x and a 2 , which, generally speaking, does not 
equal the magnitude of the vector difference. 



Multiplication of a Vector by a Scalar. Multiplication of the vector 
a by the scalar a yields a new vector b = aa whose magnitude is 
| a | times that of the vector a (i.e. 6 = | a j a). The direction 
of the vector b either coincides with that of the vector a (if a > 0), 



Fig. 1.7 Fig. 1.8 

or is opposite to it (if a < 0). It follows from the above that multi- 
plication by — 1 reverses the direction of a vector. Consequently, 
the vectors a and — a have the same magnitudes, but are opposite 
in direction. It is simple to see with the aid of Fig. 1.7 that sub- 
traction of the vector b from the vector a is equivalent to addition 
of the vector — b to the vector a. 

It follows from our definition of multiplication of a vector by 
a scalar that any vector a can be represented in the form 
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where a = magnitude of the vector a 

e a = vector with a magnitude of unity and of the same direc- 
tion as a (Fig. 1.8). 

The vector e a is called the unit vector of the vector a. The unit 
vector can be represented in the form 


(1.4) 


whence it follows that it is a dimensionless quantity. 

Unit vectors can be compared not only with vectors, but also 
with any direction in space. For example, e x is the unit vector of 




the coordinate axis x , e n is the unit vector of a normal to a curve 
or surface, and e T is the unit vector of a tangent to a curve. 

Linear Relation Between Vectors. Let us consider three non-col- 
linear vectors a, b and c that are in one plane. A glanc e at Fig. 1.9 
shows that any of them (for instance, c) can be expressed through 

p*l«WK vp ill* 

c = aa H- pb (1.5) 


the other two with the aid of the relation 

aa + Pb 


where a and p are scalars (for the case shown in the figure, a > 1 
and — 1 < P <C 0). Hence, we conclude that any vector c that is 
in the same plane as the non-collinear vectors a and b can be ex- 
pressed through the latter with the aid of linear relation (1.5). When 
the vectors a and b are fixed, any third vector is unambiguously 
determined by the two quantities a and p. 

Assume that we have three vectors a, b and c, each of which is 
not coplanar with the other two.* By analogy with Eq. (1.5), we 
can see quite easily that any vector d can be represented as a linear 
combination of the given vectors: 


d = aa + pb + yc 


( 1 . 6 ) 


When the vectors a, band care fixed, any vector d is unambiguously 
determined by the three quantities a, P and y, each of which may 
be either positive or negative. 


* Two vectors are always coplanar. This follows from the fact that their 
tails can be made to coincide by translation, and they will thus be in one plane. 
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Projection of a Vector, Let us consider a direction in space that 
we shall set by the axis l (Fig. 1.10). Let the vector a make the 
angle 9 with the axis Z*. The quantity 

ai = a cos 9 (1.7) 

(where a is the magnitude of the vector) is called the projection 
of the vector a onto the axis Z. A projection is designated by the 
same symbol as its vector, with the addition of a subscript showing 
the direction onto which the vector has been projected. 


* o 

Fig. 1.11 Fig. 1.12 




A projection of a vector is an algebraic quantity. If the vector 
makes an acute angle with the given direction, then cos 9 > 0 , 
and the projection is positive. If the angle 9 is obtuse, then cos 9 <C 
< 0, and, consequently, the projection is negative. When a vector 
is at right angles to a given axis, its projection equals zero. 

The projection of a vector has a simple geometrical meaning. 
It equals the distance between the projections of the tail and the 
tip of the segment depicting the given vector onto the given axis. 
When q> <1 rc/2, this distance is assumed to be positive, and when 
9 > jx/2, it is negative. 

Let a = a x + a 2 + a 3 + a 4 (Fig. 1.11). It is easy to see from 
the figure that the projection of the resultant vector a onto a direc- 
tion l equals the sum of the projections of the separate vectors being 
added: 

= a il + a 2 1 + # 3 1 + a kl (1 -8) 

We must remind our reader that when adding the projections of the 
vectors shown in Fig. 1.11, the distances 0-1 , 1-2 , and 2-3 have to 
be taken with the plus sign, and the distance 3-4 with the minus 
sign. Equation (1.8) holds for any number of addends. 


* If the straight line along which the vector a is directed and the axis t 
do not intersect, the angle <p should be found by drawing a straight line parallel 
to the vector a and intersecting the axis l. The angle Between this line and the 
axis l will be the angle <p we are interested in. 
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Expressing a Vector Through Its Projections onto the Coordinate 

Axes. Let us take Cartesian coordinate axes and consider the vector 
a in a plane at right angles to the z-axis (Fig. 1.12). We shall intro- 
duce the unit vectors of the coordinate axes, i.e. the unit vectors 
e y* and e i ( e z is not shown in the drawing, it is perpendicular 
to the plane of the drawing and directed toward us). It must be 
noted that these three unit vectors completely determine a system 
of coordinates and are therefore called the basis of the coordinate 
system. 

Inspection of Fig. 1.12 shows that the vector a can be represented 
in the form of a linear combination of the unit vectors e* and e y 
[see Eq. (1.5)1: 

a == I fly6y 

The projections of the vector onto the coordinate axes play the 
part of the coefficients a and p. In the example being considered, 
the projection a x is negative, therefore the vector a x e x has a direc- 
tion opposite to that of the unit vector e*. 

We took the vector a perpendicular to the z-axis owing to which 
a z = 0. In the general case when all three projections of a vector 
differ from zero, we have 

a ’ r | fly®y i (1*9) 

Thus, any vector can be expressed through its projections onto the 
coordinate axes and the unit vectors of these axes. Therefore, 
the projections of a vector onto the coordinate axes are called its 
components. 

The components a x , a y , a z equal (with an accuracy to the sign) 
the sides of a right parallelepiped in which the vector a is the major 
diagonal (Fig. 1.13). We therefore have 

a 2 = a| + a y +a£ (1.10) 

Assume that c = a + b. Representing each of these vectors in 
accordance with Eq. (1*9), we get 

c x e x “F c y e y “F c z e z — ( a x "F b x ) ~F ( a y “F by) e y ~F ( \ a 2 “F b z ) e z 

(we have factored out e*, e y , and e z ). Equal vectors have identical 
projections onto the coordinate axes. On these grounds, we can 
write that 

c x = a x + b XJ Cy = dy -f - & y , c z = a z + b z (1.11) 

[compare with Eq. (1.8)1. Equations (1.11) express analytically the 
rule of vector addition. They hold for any number of addends. 

Position Vector. The position vector (or radius vector) r of a 
point is defined as the vector drawn from the origin of coordinates 
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to the given point (Fig. 1.14). Its projections onto the coordinate 
axes equal the Cartesian coordinates of the given point: 

r x = z, r y = y, r z = z (1.12) 

Consequently, in accordance with Eq. (1.9), the position vector 
can be represented in the form 

r == ”1” V^y "~l~ (1.13) 

By Eq. (1.10), we have 

r 2 = x 2 + y 2 + z 2 (1.14) 

The Scalar Product of Vectors. Two vectors a and b can be multi- 
plied by each other in two ways. One of them results in a scalar 




quantity, and the other in a certain new vector. Accordingly, two 
products of vectors are distinguished — the scalar product and the 
vector product. It must be noted that the operation of divid ng 
a vector by a vector does not exist. 

The scalar product of the vectors a and b is defined as the scalar 
quantity equal to the product of the magnitudes of these vectors 
and the cosine of the angle a between them: 

ab = ab cos a (1.15) 

(Fig. 1.15). When writing a scalar product, the symbols of the 
vectors being multiplied are usually written next to each other 
without any sign between them (sometimes a dot is used between 
the symbols, for example, a-b; this is why a scalar product is also 
called a dot product). Equation (1.15) expresses an algebraic quan- 
tity: when a is acute, we have ab > 0, and when it is obtuse, we have 
ab << 0. The scalar product of mutually perpendicular vectors 
(a = n/2) equals zero. 
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It must be noted that by the square of a vector is always meant 
the scalar product of this vector by itself: 

a 2 = aa = aa cos a = a 2 (1*16) 

Thus, the square of a vector equals the square of its magnitude. 
In particular, the square of any 
unit vector equals unity: 

e| = ej = ej = 1 (1.17) 

We shall note in passing that owing 
to the unit vectors being mutually 
perpendicular, scalar products such 
as equal zero if i k. 

The Kronecker symbol 8 ik is very 
convenient. It is determined as 
follows: 

8 ih = 1 when i = k 
8„-0 when F *- 1)5 

When this symbol is used, the properties of the scalar products of 
the coordinate axis unit vectors established above can be expressed 
by a single formula: 

e t e h = 6 lh (1.19) 

where the subscripts i and k can assume any of the values x , y, 
and z independently of each other. 

It follows from the definition (1.15) that a scalar product is com- 
mutative, i.e. it does not depend on the sequence of the multipliers: 

ab = ba (1.20) 

Equation (1.15) can be written in several ways: 

ab = ab cos a = (a cos a) b = a (b cos a) 

Examination of Fig. 1.15 shows that a cos a equals a b — the projec- 
tion of the vector a onto the direction of the vector b. Similarly, 
b cos a = b a — the projection of the vector b onto the direction 
of the vector a. We can therefore say that the scalar product of two 
vectors is defined as the scalar quantity equal to the product of the 
magnitude of one of the vectors being multiplied and the projection 
of the second vector onto the direction of the first one: 

ab = a b b = ab a (1.21) 

Taking into account that the projection of the sum of vectors 
equals the sum of the projections of the vectors being added, we can 
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write that 

a(b + c + — ) = a(b + c+ )pr t a = 

— a ( b a -j- c a -j- ...)=• db a -f dc a -f- • . . = ab -f- ac -J- * . . (1.22) 

Hence, it follows that the scalar product of vectors is distributive — 
the product of the vector a and the sum of several vectors equals 
the sum of the products of the vector a and each of the added vectors 
taken separately. 

Let us represent the vectors being multiplied in the form of 
Eq. (1.9) and take advantage of the distributive nature of a scalar 
product. We get 

ab = (a x e x 4” ^y^y I" d^z) I ~ ^y^y "f" — 

d x b x & x & x Q'xb y^x&y &x ^z^x^z ^ y^x^ y^x 

“4“ Q'ybyQy&y "4' dyb z &y& z ~4~ & z^ x^ z^x ~4" Q'zfoy^z^y "4" ^zJ^z^z^z 

Now let us take Eq. (1.19) into consideration. As a result, we get 
an expression for a scalar product through the projections of the 
vectors being multiplied: 

ab == d x b x ”f~ d yb y | (1.23) 

It must be noted that when the coordinate axes are rotated, the 
projections of vectors onto these axes change. The quantity db cos a 
does not depend on the choice of the axes, however. We thus conclude 
that the changes in the projections of the vectors a and b, when the 
axes are rotated, are of a nature such that their combination of the 
form of Eq. (1.23) remains invariant (unchanged): 

d x b x + dyby + d z b z = inv. (1.24) 

It is a simple matter to see that the projection of the vector a 
onto the direction l [see Eq. (1.7)1 can be represented in the form 

di = ae f (1.25) 

where e* is the unit vector of the direction l. Similarly, 

d x == aCj, dy - ■ aGy, d z = ae z (1.20) 

The Vector Product. The vector product of the vectors a and b 
is defined as the vector c determined by the equation 

c = {ab sin a) n (1*27) 

where a and b = magnitudes of the vectors being multiplied 
a = angle between the vectors 

n = unit vector of a normal* to the plane containing 
the vectors a and b (Fig. 1.16). 


» 


The symbol n is simpler and more illustrative than e n . 
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The direction of n is chosen so that the sequence of the vectors a f 
b, n forms a right-handed system. This signifies that if we look along 
the vector n, then the shortest path in rotation from the first mul- 
tiplier to the second one will be clockwise. In Fig. 1.16, the vector 
n is directed beyond the drawing, and it is therefore depicted by 
a circle with a cross*. The direction of the 
vector c coincides with that of n. 

A vector product is usually designated 
in one of two ways: 

[ab] or a X b 

the latter notation resulting in the term n 
cross product sometimes being used to 
signify a vector product. We shall use Fi S- 

the former notation, and to avoid confu- 
sion we shall sometimes put a comma between the multip- 
liers. Thus, according to Eq. (1.27), we have 

[ab] = (ab sin a) n (1.28) 

A glance at Fig. 1.16 shows that the magnitude of a vector product 
has a simple geometrical meaning — the expression ab sin a numeri- 
cally equals the area of a parallelogram constructed on the vectors 
being multiplied. 

We determined the direction of the vector [ab] by relating it 
to the direction of rotation from the first multiplier to the second 
one. When considering vectors such as the position vector r, the 
velocity v, and the force F, the choice of their direction is quite 
obvious — it follows from the nature of these quantities themselves. 
Such vectors are called polar or true. Vectors of the type [a, b] 
whose direction is related to that of rotation are called axial or 
pseudovectors. When conditions change, for example, upon going 
over from a right-hand system of coordinates to a left-hand one, 
the directions of pseudovectors are reversed, while those of true 
vectors remain unchanged. 

It must be borne in mind that a vector product will be a pseudo- 
vector only when both of the vectors being multiplied are true (or 
both are pseudo vectors). The vector product of a true vector and 
a pseudovector will be true. Reversing of a condition determining 
the direction of a pseudovector will lead in this case to a change 

* We shall depict vectors perpendicular to the plane of a drawing by a cir- 
cle with a cross in it if the vector is directed away from us, and by a circle with 
a point at its centre if the vector is directed toward us. For clarity, we can ima- 
gine a vector in the form of an arrow with a tapered tip and cross-shaped feathers 
on its tail. Thus, when the vector is directed toward us (the arrow is flying toward 
us), we see a circle with a point; when the vector is directed away from us (the 
arrow is flying away from us), we see a circle with a cross. 
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in the sign in front of the vector product and also to a change in the 
sign of one of the multipliers. As a result, the quantity expressed 
by the vector product remains unchanged. 

Since the direction of a vector product is determined by the 
direction of rotation from the first multiplier to the second one, 
the result of vector multiplication depends on the order of the 



multipliers. Transposition of the multipliers 
leads to reversing of the direction of the 
resultant vector. Thus, a vector product does 
not have the property of commutativity: 

[ba] = — [ab] (1.29) 

A vector product can be proved to be dis- 
tributive, i.e. it can be shown that 

[a, (bj + b 2 + ...)]= [abil + [ab 2 l + . . . 


Fig. 1.17 


(1.30) 


Let us consider the vector products of the unit vectors of the 
coordinate axes (Fig. 1.17). In accordance with the definition (1.28), 
we have 


[e x e x l = ICyGyl == (c z e z l == 0 
[e x e y ] = [e y e x l = e z 

== === ®3C 


(1.31) 


Representing the vectors being multiplied in the form of Eq. (1.9) 
and taking advantage of the distributivity of a vector product, we 
get: 

[ab] = [(a x e x “f" Gy^y H” ^z®z)» {bx G x H"“ = 

= d x b x [e x e x ] H f* &xby “l” ^x^z [^cc^zl ”1” 

dyb x [c y c x ] “I - &yby [e y e y ] d y b z [c y e z ] ~f* 

d z b x le z e x J d z b y I ®z^z 

Taking into account relation (1.31), we arrive at the following 
expression: 

[ab] = e x ( d y b z — a z b y ) + e y ( d z b x — a x b z ) + e z ( d x b y — a y b x ) (1.32) 
The above expression can be represented in the form of a determinant 



(1.33) 
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Scalar Triple Product. A scalar triple product of three vectors 
is defined as the expression a [be], i.e. the scalar product of the 
vector a and the vector product of the vectors b and c. According 
to the definitions (1.15) and (1.28), we have 

a [be] = a {be sin (b, c)} cos (a, n) 

Here (b, c) is the angle between b and c, and (a, n) is the angle 
between the vector a and the unit vector n determining the direction 
of the vector Ibcl. Inspection of 
Fig. 1.18 shows that the expres- 
sion be sin (b, c) numerically equals 
the area of the base of a paral- 
lelepiped constructed on the vec- 
tors being multiplied, while the 
expression a cos (a, n) numerically 
equals the altitude of this paral- 
lelepiped taken with the plus sign 
if the angle (a, n) is acute, and 
with the minus sign if it is obtuse. 

Consequently, the expression a [be] 
has a simple geometrical meaning — it numerically equals the 
volume of a parallelepiped constructed on the vectors being 
multiplied [taken with the plus or minus sign depending on the 
value of the angle (a, n)l. In calculating the volume of a parallele- 
piped, the result cannot depend on which of its faces is taken as 
the base. Hence, it follows that 

a [be] = b [cal = c [ab] (134) 

Thus, a scalar triple product permits cyclic transposition of the 
multipliers, i.e. substitution for each of the multipliers of the one 
following it in the cycle: 

a 

c b 

Vector Triple Product. Let us consider a vector triple product 
of the three vectors a, b, and c: 

d = [a, [bell 

Any vector product is perpendicular to both multipliers. Therefore, 
the vector d is perpendicular to the unit vector n determining the 
direction of the vector [be]. Hence, it follows that the vector d 
is in the plane formed by the vectors b and c and, consequently, 
can be represented as a linear combination of these vectors: 

d = ab + Pc 



Fig. 1.18 
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[see Eq. (1.5)]. We find from the relevant calculations that a = ac 
and P = — ab. Thus, 

[a, [bell = b (ac) — c (ab) (1.35) 

Derivative of a Vector. Let us consider a vector that changes in 
time according to a known law a (t). The projections of this vector 
onto the coordinate axes are preset functions of time. Hence, 

a (*) = e x a x (t) + e v a v (t) + e 2 a z (t) (1.36) 


(we assume that the coordinate axes do not rotate in space so that 
their unit vectors do not change with time). 

Let the vector projections receive the increments A a xi A a y , Aa z 
during the time A t. The vector therefore receives the increment 
Aa = e*A a x + e y A a y + e z A a z . The rate of change of the vector 
a with time can be characterized by the ratio of Aa to A*: 


Aa Aa x , & a y A a z 

-Zt=e*—+*v-Zr+ez-it 


(1.37) 


This expression gives the mean rate of change of a during the time 
interval A£. Let us assume that a changes continuously with time, 
without any jumps. Consequently, the smaller the interval A t, 
the more accurately does the value of Eq. (1.37) characterize the 
rate of change in a at the moment t preceding the interval A*. There- 
fore, the rate of change in the vector a at the moment t equals the 
limit of Eq. (1.37) obtained when At tends to zero: 

Aa 

the rate of change in a= lim = 

A*-0 


= e x lim 

At — 0 


Aa s 
At 


4-e y lim 


A a y 

~A T 


e z lim 

At- 0 


Aa z 

At 


(1.38) 


If there is a function / (£) of the argument t , then the limit of the 
ratio of the increment of the function A / to the increment of the 
argument A t obtained when A* tends to zero is called the derivative 
of the function / with respect to t and is designated by the symbol 
df/dt. Expression (1.38) can therefore be written as follows: 


da_ 

dt 


= e Y 


da x 

dt 


da y 

~3T 



(1.39) 


The result obtained signifies that the projections of the vector d&ldt 
onto the coordinate axes equal the time derivatives of the projections 
of the vector a: 

da y 
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It is customary practice in physics to denote time derivatives by 
the symbol of the corresponding quantity with a dot over it, for 
example, 


d<p • d 2 cp •• da • d 2 a •• 
~dt~ < ^’ ~dt 8 ’ It*—* 


(1.41) 


Using this notation, we can write formula (1.39) as follows: 

• • • • 

a = e x a x + e y a u + e z a z (1 .42) 

If we take the position vector r ( t ) of a moving point as a (t), then 
by Eq. (1.42) we have 

• • • • 

r = e x x + e y y + e z z (1-43) 

where x, y, z are functions of t, namely, x = x (t), y — y ( t ), 
z = z ( t ). 

The differential (“increment”) of the function / (t) is defined 
as the expression 

df = V dt (1.44) 

where /' is the derivative of / with respect to t. According to 
Eq. (1.39), the differential of the vector a is determined by the 
formula 

da = e x da x + e y da y + e z da z (1-45) 

In particular, 

dr = e^dx + e y dy + e z dz (1-46) 

It must be noted that the increment of a function during a very 
short, but finite interval A t approximately equals 

A/«/'Af = -^-A< (1.47) 

In the limit, when At — >- 0, the approximate equation (1.47) trans- 
forms into the accurate equation (1.44). 

A formula similar to (1.47) can also be written for the vector 
function 

Aa«-^A* (1.48) 


Derivative of the Product of Functions. We shall consider the 
function b (t) that equals the product of the scalar function <p (£) 
and the vector function a(£), i.e. b(£) = (p(t)-a(t) or, more briefly, 
b = (pa. Let us find the increment of the function b: 

A b = A (<pa) = (cp + A<p) (a -f- Aa) — (pa = <pAa + aA(p + A<pAa 
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Representing the increments of the functions in the form of expres- 
sions (1.47) and (1.48), we get: 


whence 


Ab da . dtp . dtp da * , 

_« < p_ + a^f + ^ 7 r A* 


In the limit when A t tends to zero, this approximate equation trans- 
forms into an accurate one. Thus, 


db 

-n- = hm 
dt At-»0 


Ab 

At 




The first two a ddend s do not depend on A t and therefore do not 
change when going over to the limit. The limit of the third addend 
equals zero. Hence, substituting (pa for b, we obtain: 


^te a ) = ( P^+ a ir = <Pa + 'P a (!- 49 ) 


Now let us consider the scalar product of two vector functions 
a(£) and b(£). The increment of this product is 

A (ab) = (a + Aa) (b + Ab) — ab = aAb + bAa + AaAb « 

« ai>A* + baA* + ab (A*) 2 

Hence 

JL. (ab) = lim A = lim (ab + ba + abAl) 

at Af-0 Af-0 

or finally 

(ab) = ab + ab (1.50) 

Multiplying Eq. (1.50) by dt , we get a differential: 

d (ab) = a dh + b da (1.51) 

Let us calculate the derivative and the differential of the square 
of a vector function. According to Eqs. (1.50) and (1.51), we have 

a 2 = 2aa (1.52) 

d (a 2 ) = 2a da (1.53) 

Taking into account that a 2 = a 2 [see Eq. (1.16)1, we can write: 

2a<fa = d(a 2 ) or ada = <2(-y-) (1-54) 
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Finally, let us consider the derivative of the vector product of the 
functions a ( t ) and b (t). The increment of the function being con- 
sidered is 

A [ab] = 

= [(a-f Aa), (b-f- Ab)] — [ab] — fa, Ab] + [Aa, b] + [Aa, Ab] « 

» [a, bA*H-[aAf, b]-f{aAf, bA*] 

Correspon d ingly , 

•4- [ab] = lim {[ab] + [ab] -f [ab] A*} 

At~*0 

After a limit transition, we arrive at the formula 


A ( ab] = [ab] + [ab] (155) 


Baft) 


Derivative of a Unit Vector. Let us consider the unit vector e a 
of the vector a. It is obvious that the vector e a can change only 
in direction. Assume that during the very short interval A t the 
vector a and together with it the unit vector e a rotate through the 
angle Acp (Fig. 1.19). At a low value of A<p, 
the magnitude of the vector Ae a approximately 
equals the angle Acp, namely, | Ae a | « A <p 
(the segment depicting Ae a is the base of an 
isosceles triangle with sides equal to unity). 

We must note that the smaller is Acp, the more 
accurate is our approximate equation. The vec- 
tor Ae a itself can be represented in the form 

Ae a = | Ae a [ e Ae ^ Acp-e Ae 




i 
i 


where e Ae is the unit vector of the vector 119 

Ae a . When Acp tends to zero, the unit vec- 
tor e Ae will rotate and in the limit coincide with the unit vec- 
tor e ± perpendicular to e a (see Fig. 1.19). 

The derivative of e a with respect to t 7 by definition, is 


de a 



Thus, 


e a = <pe ± 


(1.56) 


The quantity <p = d<p/dt is the angular velocity of rotation of the 
vector a (see Sec. 1.5). The unit vector e ± is in the plane in which 
the vector a is rotating at the given moment, and its sense is in the 
direction of rotation. 


36 


The Physical Fundamentals of Mechanics 


1.3. Velocity and Speed 

A point particle in motion travels along a certain line. The latter 
is called its path or trajectory*. Depending on the shape of a tra- 
jectory, we distinguish rectilinear or straight motion, circular 
motion, curvilinear motion, etc. 

Assume that a point particle (in the following we shall call it 
simply a particle for brevity’s sake) travelled along a certain tra- 
jectory from point 1 to point 2 (Fig. 1.20). The path between points 1 




and 2 measured along the trajectory is called the distance travelled 
by the particle. We shall denote it by the symbol s. 

The straight line between points 1 and 2, i.e. the shortest distance 
between these points, is called the displacement of the particle. 
We shall denote it by the symbol r 12 . Let us assume that a particle 
completes two successive displacements r 12 and r 23 (Fig. 1.21). 
It is natural to call such a displacement r 13 the sum of the first 
two that leads to the same result as they do together. Thus, displace- 
ments are characterized by magnitude and direction and, besides, 
are added by using the parallelogram method. Hence, it follows 
that displacement is a vector. 


* It must be noted that the concept of a trajectory can be applied only to 
a “classical” particle to which accurate values of its coordinate and momentum 
"(i.e. velocity) can be ascribed at each moment of time. According to quantum me- 
chanics, real particles can be characterized with the aid of a coordinate and mo- 
mentum only with a certain accuracy. The limit of this accuracy is determined 
by the equation of Heisenberg’s uncertainty principle: 

Ax-A p $5 h 

Here Ax is the uncertainty in the coordinate of a particle, A p is the uncertainty 
in its momentum, and h is Planck’s constant h divided by 2xc, i.e. h = hl2n = 
= 1.05 X 10 -34 J-s. The sign > signifies “greater than a value of the 
order of”. 

Replacing the momentum with the product of the mass and the velocity, 
we can write 

Ax*Ai; > — 

m 
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In everyday life, we use the terms speed and velocity interchange- 
ably, but in physics there is an important distinction between 
them. Speed depends on the distance travelled, and velocity on the 
displacement. Speed is the distance travelled by a particle in unit 
time. If a particle travels identical distances during equal time 
intervals that may be as small as desired, its motion is called uni- 
form. In this case, the speed of the particle at each moment can 
be calculated by dividing the distance s by the time £. 

Velocity is a vector quantity characterizing not only how fast 
a particle travels along its trajectory, but also the direction in which 



Fig. 1.22 Fig. 1.23 


the particle moves at each moment. Let us divide a trajectory into 
infinitely small portions of length ds . An infinitely small displace- 
ment dr corresponds to each of these portions (Fig. 1.22). Dividing 
this displacement by the corresponding time interval dt , we get 
the instantaneous velocity at the given point of the trajectory: 



Thus, the velocity is the derivative of the position vector of the 
particle with respect to time. The displacement dr coincides with 
an infinitely small element of the trajectory. Consequently, the 
vector v is directed along a tangent to the trajectory (see Fig. 1.22). 

It can be seen from this relation that the smaller the mass of a rticle, the more 
uncertain do its coordinate and velocity become, and, consequently, the less 
applicable is the concept of trajectory. For macroscopic bodies (i.e. bodies 
formed by a very great number of molecules), the uncertainties in the coordinate 
and velocity do not exceed the practically attainable accuracy of measuring 
these quantities. Hence, the concept of trajectory may be applied to such bodies 
without any reservations. For microparticles (electrons, protons, neutrons, 
separate atoms and molecules), the concept of trajectory either cannot be ap- 
plied at all, or can be applied with a limited accuracy, depending on the con- 
ditions in which motion occurs. For example, the motion of electrons in a cath- 
ode-ray tube can approximately be considered as occurring along certain tra- 
jectories. 
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Reasoning more strictly, to derive formula (1.57) we must proceed 
as follows. Having fixed a certain moment of time t , let us consider 
the increment of the position vector Ar during the small time inter- 
val Az* following t (Fig. 1.23). The ratio Ar/A t gives the average 
value of the velocity during the time Atf. If we take smaller and 
smaller intervals A £, the ratio Ar/A t in the limit will give us the 
value of the velocity v at the moment t: 


, . Ar dr 
v = lim — = — 

At- 0 At dt 


(1.58) 


We have arrived at formula (1.57). 

Let us find the magnitude of the expression (1.58), i.e. the mag- 
nitude of the velocity v: 

V=\ V|=|lim 1 = lim -L|p- (1.59) 

We cannot write Ar instead of | Ar | in this formula. The vector Ar 
is in essence the difference between two vectors (r at the moment 

t + At minus r at the moment t). 
Therefore, its magnitude may be 
written only with the aid of vertical 
bars [see Eq. (1.2)1. The symbol 
[ Ar | signifies the magnitude of the 
increment of the vector r, whereas 
Ar is the increment of the mag- 
nitude of the vector r, i.e. A | r ). 
These two quantities, generally speaking, do not equal each other: 

| Ar | =/= A | r | = Ar 

The following example will illustrate this. Assume that the vector r 
receives such an increment Ar that its magnitude does not change, 
i.e. | r + Ar | = | r | (Fig. 1.24). Consequently, the increment of 
the magnitude of the vector equals zero (A | r | = Ar = 0). At 
the same time, the magnitude of the increment of the vector r, 
i.e. | Ar |, differs from zero (it equals the length of 2<-3). What has 
been said above holds for any vector a: in the general case | Aa | ^ 
=/= A a. 

Inspection of Fig. 1.23 shows that the distance As, generally 
speaking, differs in value from the magnitude of the displacement 
| Ar |. If we take increments of the distance As and the displace- 



* The symbol A (delta) is used in two cases: 

(a) for designating the increment of a quantity. In the case being consid- 
ered, Ar is the increment of the position vector r during the time At; 

(b) for designating a fraction of a quantity. For example, At is a fraction 
of the total time t during which motion occurs, and As is a fraction of the entire 
distance s travelled by the particle. 



Kinematics 


39 


ment Ar corresponding to smaller and smaller time intervals A f, 
then the difference between As and | Ar | will diminish, and their 
ratio in the limit will become equal to unity: 


lim 

0 


As 

I Ar| 


= 1 


On these grounds, we can substitute As for | Ar | in formula (1.59), 
which gives us the expression 

(1 * 60) 

Thus, the magnitude of the velocity equals the derivative of the 
distance with respect to time. 

It is evident that the quantity which in everyday life we call 
the speed is actually the magnitude of the velocity v. In uniform 
motion, the magnitude of the velocity remains constant (u = const), 
whereas the direction of the vector v changes arbitrarily (in parti- 
cular it may be constant). 

In accordance with Eq. (1.57), the elementary displacement of 
a particle is 

dr = v dt (1.61) 


Sometimes for clarity’s sake, we shall denote an elementary dis- 
placement by the symbol ds, i.e. write Eq. (1.61) in the form 

ds = v dt (1.62) 

The velocity vector, like any other vector, can be represented 
in the form 

v = v x e x + v y e y + v z e z (1.63) 

where v xy v yy v z are the projections of the vector v onto the coor- 
dinate axes. At the same time, the vector r equal to v, according 
to Eq. (1.43), can be written as follows: 

t • # • 

r = xe x + ye y + ze t (1-64) 

It follows from a comparison of Eqs. (1.63) and (1.64) that 

v x = x, v v = y, v z = z (1.65) 

Consequently, the projection of the velocity vector onto a coordi- 
nate axis equals the time derivative of the relevant coordinate of 
the moving particle. Taking Eq. (1.10) into account, we get: 

v = ~V * 2 + y 2 + z 2 (1.66) 

The velocity vector can be written in the form v = ve v , where v 
is the magnitude of the velocity, and e„ is the unit vector of ▼. 
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Let us introduce the unit vector x of the tangent to a trajectory 
with its sense the same as that of v. Hence, obviously, the unit 
vectors e p and x will coincide, and we can write the following expres- 
sion: 

v = ve D — vx (1-67) 

Let us obtain still another expression for v. For this purpose, 
we shall introduce the position vector in the form of r = re r into 

Eq. (1.57). According to Eq. (1.49), 
we have 

v = r = re r -f re r (1.68) 

We shall limit ourselves, for sim- 
plicity, to the case when the trajec- 
tory is a plane curve, i.e. a curve 
such that all its points are in a 
single plane. Let this plane be the 
plane x , y. In Eq. (1.68), the vec- 
tor v is written in the form of two 
components (Fig. 1.25). The first of 
them, which we shall designate 
v r , is 

v r = re r (1.69) 

It is directed along the position vector r and characterizes the rate 
of change of the magnitude of r. The second component, which we 
shall designate v<p, is 

v<p = re r (1-70) 

It characterizes the rate of change of the direction of the position 
vector. 

Using Eq. (1.56), we can write that 

• d<p 

e r — e<p — <p e <p 

where <p is the angle between the position vector and the x-axis, 
and e<p is a unit vector perpendicular to the position vector with 
its sense in the direction of growth of the angle <p [in Eq. (1.56) 
the symbol e ± was used for this unit vector]. Using this value in 
Eq. (1.70), we get: 

v<p = r<pe,p (1.71) 

We have introduced the symbols v<p and e,p to underline the fact 
that the component v<p and the corresponding unit vector are related 
to a change in the angle <p. 
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The vectors v r and v v are obviously mutually perpendicular. Hence, 

v=*Yi4 + v% = Kr 2 + r2q> 2 (1.72) 

Now let us consider how to calculate the distance travelled by 
a particle from the moment of time t x to t 2 if we know the speed 
at each moment. Let us divide the interval t 2 — t x into N small, 
but not necessarily equal intervals: A t u At 2 , . . A t N . The total 
distance 5 travelled by a particle can be represented as the sum of 
the distances As ly A s 2 , .... A s N travelled during the relevant time 
intervals At: 

N 

s == As i ~f~ As 2 *4” • • • ~f“ Asjf = As $ 

i=i 

In accordance with formula (1.60), each of the addends can approx- 
imately be represented in the form 

A Si « v t At i 

where A t t is the time interval during which the distance A$i was 
travelled, and v t is one of the values of the speed during the time 
A tt. Hence, 

N 

stt^ViAti (1-73) 

i*=i 

This expression will be obeyed more accurately with diminishing 
time intervals At t . In the limit when all the A^’s tend to zero (the 
number of intervals A t% will correspondingly grow unlimitedly), 
the approximate equation will become accurate: 

N 

5 = lim 2 ViAti 

At t -*0 i— 1 

This expression is a definite integral of the function v (t) taken 
within the limits from t x to t 2 . Thus, the distance travelled by 
a particle during the interval from t x to t 2 is 

s=jn(*)<* (1.74) 

1 1 

It must be underlined that here we are speaking of the speed. If we 
take an integral of the velocity v ( t ), we get the vector of the displace- 
ment of the particle from the point where it was at the moment 
to the point where it was at the moment t 2 - 

j v (t)dt= j dr = r 12 (l- 75 ) 

t« ii 


[see Eq. (1.61)]. 
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If we plot the dependence of v on t (Fig. 1.26), then the distance 
travelled can be represented as the area of the figure confined be- 
tween the curve v ( t ), the straight lines t = t x and t = J a , and the 
*-axis. Indeed, the product t;* A t t numerically equals the area of 

the f-th strip. The sum (1.73) equals 
the area of the figure confined on 
top by the broken line formed by 
the top edges of all such strips. 
When all the A^’s tend to zero, 
the width of a strip diminishes 
(their number grows simultane- 
ously), and the broken line will 
coincide with the curve v = v (t) 
in the limit. Thus, the distance 
travelled during the time from 
the moment t x to the moment t 2 
numerically equals the area con- 
fined between the curve of the function v = v (t) y the time 
axis, and the straight lines t = t x and t = t 2 . 

It should be noted that the average value of the speed during 
the time from t x to t 2 , by definition, is 



(The symbol ( ) embracing the v indicates an average.) Introducing 
into this equation the expression (1.74) for 5, we get 

= } v(t)dt (1.76) 

2 1 U 

The average values of any scalar or vector functions are calculated 
in a similar way. For example, the average value of the velocity is 

(1 ' 77> 

*1 

[see Eq. (1.75)]. The average value of the function y (x) within 
the interval from x x to x 2 is determined by the expression 




1.4. Acceleration 


The velocity v of a particle can change with time both in magni- 
tude and in direction. The rate of change of the vector v, like the 
rate of change of any function of time, is determined by the deriv- 
ative of the vector v with respect to t. Denoting this derivative 
by the symbol a, we get: 


i . Av d\ 
a — lim — = — = v 

At -* 0 ^ dt 


(1.79) 


The quantity determined by formula (1.79) is called the accelera- 
tion of the particle. 

It must be noted that the acceleration a plays the same part with 
respect to v as the vector v does with respect to the position vector r. 

Equal vectors have identical projections onto the coordinate axes. 
Consequently, for example, 


( dv \ dv x 

~dt ) pr. x ~dt~ 


[see Eqs. (1.40)]. At the same time according to Eqs. (1.65), we 
have v x = x = dx/dt. Therefore, 


dv x d / dx \ d 2 x * * 

~Tt nr [nr ) it?~ x 


What we have obtained is that the projection of the acceleration 

vector onto the ar-axis equals the second derivative of the coordi- 

• • 

nate x with respect to time: a x = x. Similar expressions are obtained 
for the projections of the acceleration onto the y- and z-axes. Thus, 


# • • • • • 

a x = x, a y = y, a z = z (1.80) 

Using Eq. (1.67) for v in (1.79), we get: 

a== nr( vx '> (1 - 81 ) 

We remind our reader that r is the unit vector of a tangent to a tra- 
jectory having the same direction as v. According to Eq. (1.49), 

a = irr-fzrc (1.82) 

Hence, the vector a can be represented in the form of the sum of two 

components. One of them has the direction t, i.e. is tangent to the 
trajectory. It is therefore designated a x and is called the tangential 
acceleration. It equals 

a T = vx 


(1.83) 
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• 

The second component equal to vx is directed, as we shall show 
below, along a normal to the trajectory. It is therefore designated 
a D and is called the normal acceleration. Thus, 

a n = pr (1 .84) 

In studying the properties of the two components, we shall restrict 
ourselves for the sake of simplicity to the case when the trajectory 
is a plane curve. 

The magnitude of the tangential acceleration (1.83) is 


a x = | v | (1 .85) 

If v > 0 (the velocity grows in magnitude), then the vector a T 

has the same direction as x (i.e. the same direction as v). If v < 0 
(the velocity decreases with time), then the vectors v and ax have 

opposite directions. In uniform motion, v = 0, and, therefore, 

tangential acceleration is absent. 

To determine the properties of the 
normal acceleration [Eq. (1.84)], we 

must find out what x, i.e. the rate of 
change with time of the direction of 
a tangent to the trajectory, is deter- 
mined by. It is easy to understand that 
this rate will grow with an increasing 
curvature of the trajectory and a high- 
er velocity of a particle along it. 

The degree of bending of a plane 
curve is characterized by its curva- 
ture C determined by the expression 



C = lim i?5L 


As — 0 


As 


ds 


( 1 . 86 ) 


Fig. 1.27 where A<p is the angle between tan- 

gents to the curve at points spaced 
A s apart (Fig. 1.27). Thus, the curvature determines the rate 
of turning of a tangent in motion along a curve. 

The reciprocal of the curvature C is called the radius of curvature 
at the given point of the curve and is designated R: 

f? = -±-=lim 4 L = 4 L (1-87) 

c A<p— 0 A( P *P 


The radius of curvature is the radius of a circle that coincides at the 
given spot with the curve on an infinitely small portion of it. The 
centre of this circle is defined as the centre of curvature for the 
given point of the curve. 
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The radius and centre of curvature at point 7 (see Fig. 1.27) can 
be determined as follows. Take point V near point 1. Draw the 
tangents x and x' at these points. The perpendiculars to the tan- 
gents will intersect at a certain point O'. We must note that for 
a curve which is not a circle the distances R' and R" will differ 
somewhat from each other. If point 7' is brought closer to point 7, 
the point of intersection O' of the perpendiculars will move along 
the straight line R' and in the limit will be at point 0. It is exactly 
the latter that will be the centre of curvature for point 7. The dis- 
tances R' and R" will tend to a common limit R equal to the radius 
of curvature. Indeed, if points 7 and V are close to each other, we 
can write that A<p « A s/R' or R ' « As/A<p. In the limit when 
A<p — 0, this approximate equation will transform into the strict 
equation R = dsldtp coinciding with the definition of the radius 
of curvature [see Eq. (1.87)]. 

Let us now turn to the calculation of a n [see Eq. (1.84)]. According 
to Eq. (1.56), 


x = 



( 1 . 88 ) 


where n is the unit vector of the normal to the trajectory with its 
sense in the direction of rotation of the vector x when a particle 
travels along the trajectory [in Eq. (1.56) a similar unit vector was 
designated e_J. The quantity dq>/dt can be related to the radius of 
curvature of the trajectory and the speed of the particle v. It follows 
from Fig. 1.27 that 


A<p« 


As v'A t 

R ' R' 


where A<p = angle of rotation of the vector x during the time A t 
(coinciding with the angle between the perpendicu- 
lars 7?' and R n ) 

v f = average speed over the distance As. 

Hence, 

Acp v 9 
At ^ R' 


In the limit when A* tends to zero, the approximate equation will 
become a strict one, the average speed v' will transform into the 
instantaneous speed v at point 7, and R r will become the radius 
of curvature R. As a result, we get the equation 



(1.89) 


(C is the curvature). Hence, the rate of rotation of the velocity 
vector, as we assumed, is proportional to the curvature of the trajec- 
tory and the speed of a particle along its trajectory. 
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Using Eq. (1.89) in (1.88), we find that x = (v/R) n. And at last, 
introducing this expression into Eq. (1.84), we arrive at the final 
formula for the normal acceleration: 

a„ = 4“ ( 19 °) 

Thus, the acceleration vector when a particle travels along a plane 
curve is determined by the following expression: 

a = a t + a n = i;x + 4 n (1.91) 

The magnitude of the vector a is 

a = ^a*+a* = ]/ » 2 +(4) 2 (1.92) 

In rectilinear motion, the normal acceleration is absent. It must 
be noted that a n vanishes at the inflection point of a curvilinear 

trajectory (at point IP in Fig. 1.28). 
At both sides of this point, the vec- 
tors a n have different directions. The 
vector a n cannot change in a jump. Its 
direction reverses smoothly, and it 
becomes equal to zero at the inflection 
point. 

Assume that a particle is travelling 
uniformly with an acceleration con- 
stant in magnitude. Since in uniform motion the magnitude of 
the velocity does not change, we have a T = 0, so that a = a n . 
The constant magnitude of a n signifies that v 2 /R = const. Hence, 
we conclude that R — const ( v — const because the motion is 
uniform). This means that the particle is travelling along a curve 
of constant curvature, i.e. a circle. Thus, when the acceleration of 
a particle is constant in magnitude and is directed at each moment 
of time at right angles to the velocity vector, the trajectory of the 
particle will be a circle. 



Fig. 1.28 


1.5. Circular Motion 

The rotation of a body through a certain angle q> can be given 
in the form of a straight line whose length is cp and whose direction 
coincides with the axis about which the body is rotating. To indicate 
the direction of rotation about a given axis, it is related to the line 
depicting rotation by the right-hand screw rule: the line should 
be directed so that when looking along it (Fig. 1.29) we see clockwise 
rotation (when rotating the head of a right-hand screw clockwise. 


we cause it to move away from us). We showed in Sec. 1.2 (see 
Fig. 1.4) that rotations through finite angles are not added by the 
parallelogram method and are therefore not vectors. Matters are 
different for rotations through very small angles Acp. The distance 
travelled by any point of a body when rotated through a very small 
angle can be considered as a straight line (Fig. 1.30). Consequently, 
two small circular motions A<p x and A<p 2 performed sequentially. 



/ 

/ 


Fig. 1.29 



A* Z 


Fig. 1.30 


as can be seen from the figure, result in the same displacement 
Ar 3 = Ar 2 + Ar 2 of any point of the body as the circular motion 
Acp 3 obtained from Acp x and Acp 2 by addition using the parallelogram 
method. Hence it follows that very small circular motions can be 
considered as vectors (we shall denote these vectors by Acp or dcp). 
The direction of the rotation vector is associated with the direction 
of rotation of a body. Consequently, di p is not a true vector, but 
a pseudo vector. _ 

The vector quantity 




(1.93) 


(where At is the time during which the circular motion Acp is per- 
formed) is called the angular velocity of a body*. The angular 
velocity co is directed along the axis about which the body is rotating 
in a direction determined by the right-hand screw rule (Fig. 1.31) 
and is a pseudovector. The magnitude of the angular velocity, i.e. 
the angular speed, equals dyldt. Circular motion at a constant 


The velocity v coDsidered in Sec, 1.3 is sometimes called linear. 
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angular velocity is called uniform. For uniform circular motion, 
we have (o = <p It, where q> is the finite angle of rotation during 
the time t (compare with v = sit). Thus, in uniform circular motion, 
co shows the angle through which a body rotates in unit time. 

Uniform circular motion can be characterized by the period of 
revolution T. It is defined as the time during which a body com- 



Fig. 1.31 


Fig. 1.32 


pletes one revolution, i.e. rotates through the angle 2ji radians, or 
360 degrees. Since the time interval A t = T corresponds to the 
angle of rotation Acp — 2 ji, we have 


whence 



(1.94) 

(1.95) 


The number of revolutions in unit time v is evidently equal to 


v = 



(1.96) 


It follows from Eq. (1.96) that the angular velocity equals 2jt mul- 
tiplied by the number of revolutions per unit time: 

co = 2jiv (4.97) 

The concepts of the period of revolution and the number of revo- 
lutions per unit time can also be retained for non-uniform circular 
motion. Here, we must understand the instantaneous value of T 
to signify the time during which a body would perform one revolu- 
tion if it rotated uniformly with the given instantaneous value of 
the angular velocity, and v to signify the number of revolutions 
which a body would complete in unit time in similar conditions. 
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The vector o> may vary either as a result of a change in the speed 
of rotation of a body about its axis (in this case it changes in magni- 
tude), or as a result of turning of the axis of rotation in space (in 
this case <o changes in direction). Assume that during the time A t 
the vector o> receives the increment Aco. The change in the angular 
velocity vector with time is characterized by 
the quantity J 

a = lim4f = jg- <1.98) 

At-0 ai 

called the angular acceleration. The latter, like 
the angular velocity, is a pseudovector. 

Different points of a body in circular motion 
have different linear velocities v. The velocity 
of each point continuously changes its direc- 
tion. The speed v is determined by the speed 
of rotation of the body co and the distance R 
to the point being considered from the axis 
of rotation. Assume that during a small inter- 
val of time the body turned through the angle A<p (Fig. 1.32). 
The point at the distance R from the axis travels the path 
As — R A<p. The linear speed of the point is 

v= lim lim R 4?-= R lim "tt- = it = R& 

A<^0 At Af-0 At Af-*0 At dt 



Thus, 


v == <o R 


(1.99) 


Equation (1.99) relates the linear and the angular speeds. Let 
us find an expression relating the relevant velocities v and e>. We 
shall determine the position of the point of the body being consid- 
ered by the position vector r drawn from the origin of coordinates 
on the axis of rotation (Fig. 1.33). Examination of the figure shows 
that the vector product [cor] coincides in direction with the vector v 
and its magnitude is cor sin a = co/?. Hence, 


v = [cor] 


( 1 . 100 ) 


The normal acceleration of the points of a rotating body is 
a n = v 2 IR. Introducing into this expression the value of v from 
Eq. (1.99), we get 

a n = a>*R (1.101) 


If we introduce the vector R drawn to the given point of the body 
from the axis of rotation at right angles to the latter (see Fig. 1.33), 
then Eq. (1.101) can be given a vector form: 

a n = — co 2 R 


( 1 . 102 ) 
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There is a minus sign in this formula because the vectors a n and R 
have opposite directions. 

Let us assume that the axis of rotation of a body does not turn 
in space. According to Eq. (1.85), the magnitude of the tangential 
acceleration is | dv/dt |. Using equation (1.99) and taking into 
account that the distance to the point being considered from the 
axis of rotation R — const, we can write 


Iim4f 1 = 

lim 

A (co R) 

1= lim/? 4^ 

= /? 

, . Aco 

lim -t— 

Af-*0 

A*-0 

A t 

At-*0 


At-0 At 


where a is the magnitude of the angular acceleration. Consequently, 
the magnitude of the tangential acceleration is related to the mag- 
nitude of the angular acceleration as follows: 

a z = a R (1.103) 

Thus, the normal and tangential accelerations grow linearly with 
an increasing distance to a point from the axis of rotation. 




CHAPTER 2 DYNAMICS OF A POINT 

PARTICLE 


2.1. Classical Mechanics. 

Range of Its Applicability 

Kinematics describes the motion of bodies without being con- 
cerned with why a body moves exactly in a given way (for example, 
uniformly along a circle, or with uniform acceleration along a straight 
line), and not in a different one. 

Dynamics studies the motion of bodies in connection with its 
causes (the interactions between bodies) resulting in the occurrence 
of a specific kind of motion. 

The so-called classical or Newtonian mechanics is based on three 
laws of dynamics that were formulated by Isaac Newton in 1687. 

Newton’s laws (like all other laws of physics) were the result of 
generalizing a great amount of experimental facts. Their correctness 
(although it covers a very extensive range of phenomena, the latter 
are nevertheless limited) is confirmed by the agreement of the corol- 
laries following from them with experimental results. 

Newtonian mechanics achieved such great successes during two 
centuries that many physicists of the 19th century were convinced 
in its omnipotence. It was considered that the explanation of any 
physical phenomenon required its reduction to a mechanical process 
obeying Newton’s laws. With the development of science, however, 
new facts were uncovered for which no place could be found within 
the c onfines of classical mechanics. These facts were explained in 
new theories — the special theory of relativity and quantum mecha- 
nics. 

The special theory of relativity advanced by Albert Einstein 
in 1905 radically revised Newton’s notions of space and time. This 
revision resulted in the creation of “high-speed mechanics” or, as 
it is called, relativistic mechanics. The new mechanics did not 
result, however, in complete negation of the old Newtonian mecha- 
nics. The equations of relativistic mechanics in their limit (for 
speeds small in comparison with the speed of light) transform into 
the equations of classical mechanics. Thus, classical mechanics has 
entered relativistic mechanics as a particular case of it and has 
retained its previous significance for describing motions occurring 
at speeds much smaller than that of light. 

Matters are similar with the relation between classical and quan- 
tum mechanics. 'The latter took root in the twenties of the present 
century as a result of the development of physics of the atom. The 
equations of quantum mechanics also result in those of classical 
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mechanics in their limit (for masses that are great in comparison 
with the masses of atoms). Consequently, classical mechanics is 
also a part of quantum mechanics and is a limiting case of it. 

Thus, the development of science has not eliminated classical 
mechanics, but has only shown its limited applicability. Classical 
mechanics based on Newton’s laws is the mechanics for bodies of 
large (compared with the mass of atoms) masses travelling at low 
(compared with the speed of light) speeds. 

2.2. Newton’s First Law. 

Inertial Reference Frames 

Newton’s first law is formulated as follows: every body continues 
in its state of rest or of uniform motion in a straight line unless it is 
compelled by external forces to change that state . Both states named 
are distinguished by the acceleration of the body equalling zero. 
Therefore, the first law can also be formulated as follows: the veloc- 
ity of every body remains constant (in particular, it equals zero) 
until the action of other bodies on this body causes it to change. 

Newton’s first law is obeyed not in any reference frame. We have 
already noted that the nature of motion depends on the choice of 
the reference frame. Let us consider two frames of reference moving 
with respect to each other with a certain acceleration. If a body is 
at rest relative to one of them, then it will obviously travel with 
acceleration relative to the other one. Consequently, Newton’s first 
law cannot be obeyed simultaneously in both frames. 

A reference frame in which Newton’s first law is obeyed is called 
an inertial one. The law itself is quite often called the law of inertia. 
A reference frame in which Newton’s first law is not obeyed is called 
a non-inertial reference frame. There is an infinite multitude of 
inertial frames. Any reference frame moving uniformly in a straight 
line (i.e. with a constant velocity) relative to an inertial frame will 
also be an inertial one. This will be discussed in greater detail in 
Sec. 2.7. 

It has been established experimentally that the reference frame 
whose centre coincides with the Sun and whose axes are directed 
toward appropriately selected stars is an inertial one. This system 
is defined as a heliocentric reference frame {hellos means Sun in 
Greek). Any reference frame moving uniformly in a straight line 
relative to the heliocentric frame will be an inertial one. 

The Earth moves relative to the Sun and stars along a curvilinear 
trajectory having the shape of an ellipse. Curvilinear motion always 
occurs with a certain acceleration. The Earth also rotates about its 
axis. For these reasons, a reference frame associated with the Earth’s 
surface travels with acceleration relative to the heliocentric Tefer- 



Dynamics of a Point Particle 


53 


ence frame, and is not inertial. The acceleration of such a frame, 
however, is so small that it may be considered practically inertial 
in a great number of cases. But sometimes the non-inertial nature 
of a reference frame associated with the Earth significantly affects 
the nature of mechanical phenomena being considered relative to it. 
We shall treat some of these cases on a later page. 


2.3. Mass and Momentum of a Body 


The action of other bodies on a given one causes its velocity to 
change, i.e. imparts an acceleration to it. Experiments show that 
the same action imparts accelerations differing in magnitude to 
different bodies. Every body resists attempts to change its state 
of motion. This property of bodies is called inertia. It is charac- 
terized quantitatively by a physical quantity called the mass of 
a body. 

To find the mass of a body, we must compare it with that of the 
body taken as the standard of mass. We can also compare the mass 
of the given body with that of a body having a known mass (found 
by comparing it with the standard). The masses m 1 and m 2 of two 
point particles can be compared as follows. We place the particles 
in conditions allowing us to ignore their interaction with other 
bodies. A system of bodies interacting only with one another and 
not interacting with other bodies is called isolated. We are there- 
fore considering an isolated system of two particles. If we make 
these particles interact (for example, by colliding with each other), 
their velocities receive the increments A\ 1 and Av 2 . Experiments 
show that these increments are always directed oppositely, i.e. differ 
in their sign. The ratio of the magnitudes of the velocity increments, 
however, does not depend on the method and intensity of interac- 
tion of the given two bodies or particles*. This ratio is inversely 
proportional to the ratio of the masses of the bodies being considered: 


1 Av t 1 __ m 2 
I Av 2 | 


(2-1) 


(the velocity of the body with the greater inertia, i.e. with the 
larger mass, changes less). Taking into account the relative direc- 
tion of the vectors Av x and Av 2 , Eq. (2.1) can be written in the form 

m 1 Av x = — m 2 Av 2 (2.2) 


In Newtonian mechanics (i.e. in mechanics based on Newton’s 
laws), the mass of a body is assumed to be a constant quantity not? 


♦ This holds for the case when the initial and final velocities of the par- 
ticles are small in comparison with the speed of light c. 
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depending on its velocity. At velocities smaller than the speed of 
light c (when v c), this assumption is obeyed in practice. Taking 
advantage of the constancy of mass, we can write Eq. (2.2) as follows: 

A (wxVx) = —A (m 2 v 2 ) (2.3) 

The product of the mass and the velocity of a body is called its 
momentum. Using the symbol p for it, we get 


p = mv (2.4) 

Definition (2.4) holds for point particles and extended bodies in 
translational motion. When we have to do with an extended body 
whose motion is not translational, we must imagine the body as 
a combination of particles of masses A m iy determine the momenta 
A m^i of these particles, and then add these momenta vectoriallv . 
The result will be the total momentum of the body: 

p=S Am i v * (2.5) 


In translation of a body, all the v £ ’s are the same, and Eq. (2.5) 
transforms into (2.4). “ 

Substituting the momenta p for the products rav in Eq. (2.3), 
we get Apx = — Ap 2 , whence A (p x + p 2 ) = 0. When the increment 
of a quantity equals zero, this signifies that the quantity itself 
remains unchanged. We have thus arrived at the conclusion that 
the total momentum of an isolated system of two interacting particles 
remains constant : 

P = Pi + P 2 = const (2.6) 


The above statement forms the law of conservation of momentum. 
We shall consider this law in greater detail in Sec. 3.10. 

We must note here that in relativistic mechanics (see Chap. 8) 
the expression for the momentum is more complicated than Eq. (2.4): 


P = 


mv 

Y 1 -~v 2 l^ 


(2.7) 


Here m is the so-called rest mass of a body (its mass at v = 0), 
and c is the speed of light in a vacuum. Equation (2.7) can be inter- 
preted to state that the mass of a body does not remain constant 
(as is assumed in Newtonian mechanics), but changes with the 
speed according to the law 

m (v) = — m — (2.8) 

v 7 /l— rVc* v 

Hence, Eq. (2.7) can be written as follows: 

p = m (v) v (2.9) 


i.e. in a form similar to Eq. (2.4). 

The mass m{v) determined by Eq. (2.8) is called the relativistic 
mass. We shall designate it by the symbol m T in the following. 
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2.4. Newton’s Second Law a 

Newion^s second law states that the rate of change of the momentum 
of a bd^yequals the force F acting on the body: 

•J = F (2.10) 

Equation (2.10) is called the equation of motion of a body. 

Substituting m\ for p according to Eq. (2.4) and taking into 
account that in Newtonian mechanics the mass is assumed to be 
constant, we can write Eq. (2.10) in the form 

ma — F (2.11) 

where a = v. We have thus arrived at a different formulation of 
Newton’s second law: the product of the mass of a body and its accele- 
ration equals the force acting on the body. 

Equation (2.11) has called forth and is continuing to call forth 
many controversies among physicists. To date, there is no generally 
adopted interpretation of this relation. The complication consists 
in that there are no independent ways of determining the quantities m 
and F in Eq. (2.11). To determine one of them (m or F), we have 
to use Eq. (2.11) relating it to the other one and to the accelera- 
tion a. For example, according to S. Khaikin*, u Since to establish 
a way of measuring the mass of a body we use the same second law 
of Newton (the magnitude of the mass of a body is determined by 
simultaneously measuring the force and the acceleration), then 
Newton’s second law contains, on the one hand, a statemen t that 
the acceleration is proportional to the force, and on the other, 
a definition of the mass of a body as the ratio of the force acting 
on it to the acceleration imparted to this body”. 

R. Feynman states the following about the meaning of Newton’s 
second law: “Let us ask, ‘What is the meaning of the physical laws 
of Newton, which we write as F = ma> What is the meaning 
of force, mass, and acceleration?’ Well, we can intuitively sense 
the meaning of mass, and we can define acceleration if we know the 
meaning of position and time. We shall not discuss these meanings, 
but shall concentrate on the new concept of force. The answer is 
equally simple: ‘If a body is accelerating, then there is a force on 
it’. That is what Newton’s laws say, so the most precise and beauti- 
ful definition of force imaginable might simply be to say that force 
is the mass of an object times the acceleration...”. However, “if 
we have discovered a fundamental law, which asserts that the force 


* S. E. Khaikin. Fizicheskie osnovy mekhaniki (The Physical Fundamen- 
tals of Mechanics). Moscow, Fizmatgiz (1963), p. 104. 
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is equal to the mass times the acceleration, and then define the force 
to be the mass times the acceleration, we have found out nothing ..., 
such things certainly cannot be the content of physics, because 
they are definitions going in a circle... no prediction whatsoever 
can be made from a definition — The real content of Newton’s laws 
is this: that the force is supposed to have some independent proper- 
ties , in addition to the law F = raa; but the specific independent 
properties that the force has were not completely described by 
Newton or by anybody else...”*. 

We must underline the fact that Newton’s second law (like his 
other two laws) is an experimental one. It took shape as a result 
of generalization of the data of experiments and observations. 

In a particular case when F = 0 (i.e. in the absence of action 
on a body by other bodies), the acceleration, as follows from Eq. (2.11), 
also equals zero. This conclusion coincides with Newton’s first law. 
Therefore, the first law is contained in the second one as a particular 
case of it. Notwithstanding this circumstance, the first law is for- 
mulated independently of the second one because it contains in 
essence the postulate (statement) of the existence of inertial refer- 
ence frames. 

In conclusion, we shall" note that upon an independent choice 
of the units of mass, force, and acceleration, the second law must 
be written in the form 

m* = kF (2.12) 

where A; is a constant of proportionality. 


2.5. Units and Dimensions 
of Physical Quantities 

The laws of physics, as we have already noted, establish quanti- 
tative relations between physical quantities. To establish such 
relations, it is necessary to be able to measure various physical 
quantities. 

To measure a physical quantity (for example, speed) means to 
compare it with a quantity of the same kind (in our example with 
speed) taken as a unit. 

Generally speaking, we could establish a unit for every physical 
quantity arbitrarily, regardless of other quantities. We can limit 
ourselves, however, to an arbitrary choice of the units for only 
a few (at least three) quantities taken as the basic ones. Any quanti- 
ties can be taken as the basic ones in principle. The units of all 


* R. P. Feynman, R. B. Leighton, M. Sands. The Feynman Lectures on 
Physics, Reading, Mass., Addison-Wesley (1965), p. 12>1. 
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other quantities can be established with the aid of these basic units 
using for this purpose the physical laws relating the relevant quanti- 
ty to the basic ones or to quantities for which the units have already 
been established in this way. 

Let us consider the following example to explain what has been 
said above. Assume that we have already established the units for 
mass and acceleration. Equation (2.42) expresses the law relating 
these quantities to a third physical quantity — force. We choose 
the unit of force so that the proportionality constant in this equa- 
tion will equal unity. Equation (2.12) thus acquires a simpler form: 

m = F ’ (2.13) 


It follows from Eq. (2.13) that the established unit of force is a 
force such that a body of unit mass receives an acceleration of unity 
under its action [substitution of F = i and m = 1 in Eq. (2.13) 
gives a = 1]. 

When units are selected in this way, physical relations acquire 
a simpler form. The combination of units themselves forms a definite 
system. 

There are several systems differing in the selection of the basic 
units. Systems based on the units of length, mass, and time are 
called absolute. 


USSR State Standard GOST 9867-61 in force from January 1, 
1963, provides for the use of the International System of Units, 
designated SI, in the USSR. This system of units has been intro- 
duced as preferable in all fields of science, engineering, and the nation- 
al economy, and also in education. The basic units of the SI system 
include the unit of length — the metre (its symbol is m), the unit 
of mass — the kilogramme (kg), and the unit of time — the second (s); 
The SI system is thus an absolute one. In addition to the three units 
indicated above, the other basic units of this system are the unit 
of current — the ampere (A), the unit of thermodynamic tempera- 
ture — the kelvin (K), the unit of luminous intensity — the candela 
(cd), and the unit of the amount of substance — the mole (mol). 
These units will be treated in the corresponding parts of our course. 

The metre is defined as the length equal to 1 650 763.73 wave- 
lengths in vacuum of the radiation corresponding to the transition 
between the levels 2 p l0 and 5 d 5 of the krypton-86 atom* (the orange 
line of krypton-86). The metre approximately equals 1/40 000 000th 
of the length of an Earth’s meridian. Multiple and submultiple 
units are also used, namely, the kilometre (1 km = 10 3 m), the 
centimetre (1 cm — 10” 2 m), the millimetre (1 mm = 10~ 3 m), 


the micrometre (1 pm = 10 * m), etc. 




* The meaning of these symbols will be explained in Vol. Ill in the part 
“Atomic Physics”. 
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The kilogramme is the mass of a platinum and iridium* body 
kept in the International Chamber of Weights and Measures at 
Sevre (near Paris). This body is called the international prototype 
of the kilogramme. The mass of this prototype is close to that of 
1000 cm 3 of pure water at 4 °C. A gramme equals one-thousandth 
of a kilogramme. 

The second is the duration of 9 192 631 770 periods of the radia- 
tion corresponding to the transition between the two hyperfine 
levels of the ground state of the cesium-133 atom. The second ap- 
proximately equals 1/86 400 of the mean solar day. 

Physics also employs an absolute system of units called the cgs 
system. The basic units in this system are the centimetre, gramme, 
and second. 

The units of the quantities that we introduced in kinematics 
(velocity and acceleration) are derived from the basic units. Thus, 
the unit of velocity (or speed) is the velocity of a body in uniform 
motion that travels a distance of unit length (metre or centimetre) 
in unit time (second). This unit is designated m/s in the SI system 
and cm/s in the cgs system. The unit of acceleration is the accelera- 
tion of uniformly varying motion when the velocity of a body changes 
by one unit (one m/s or cm/s) in unit time (second). This unit 
is designated m/s 2 in the SI system and cm/s 2 in the cgs system. 

The unit of force in the SI system is called the newton (N). Accord- 
ing to Eq. (2.13), one newton equals the force that imparts an ac- 
celeration of 1 m/s 2 to a body with a mass of 1 kg. The unit of force 
in the cgs system is called the dvne (dyn). One dyne equals the 
force that imparts an acceleration of 1 cm/s 2 to a body with a mass 
of 1 g. The newton and dyne are related as follows: 

1 N = 1 kg x 1 m/s 2 = 10 3 g X 10 2 cm/s 2 = 10 s dyn 

The mk(force)s system (usually called the technical system) is 
widely used in engineering. The basic units of this system are the 
metre, the unit of force — the kilogramme-force (kgf), and the second. 
The kilogramme-force is defined as the force that imparts an accele- 
ration of 9.806 55 m/s 2 (the normal acceleration of free fall) to a mass 
of 1 kg. It follows from this definition that 1 kgf = 9.806 55 N 
(approximately 9.81 N). 

The unit of mass in the mk(force)s system, according to Eq. (2.13), 
should be taken equal to the mass of a body that receives an accele- 
ration of 1 m/s 2 when acted upon by a force of 1 kgf. Although many 
names have been proposed for this unit**, none of them has been 


* An alloy of platinum and iridium has a high hardness and resistance to 
corrosion (i.e. is virtually not subjected to the chemical action of the surround- 
ings). 

** See L. A. Sena. Units of Physical Quantities and Their Dimensions . 2nd 
ed., Moscow, Mir Publishers (1975), pp. 9, 54. 
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legalized, and it isdesignated kgf-s 2 /m. It is obvious that 1 kgf-s 2 /m= 
= 9.806 55 kg (approximately 9.81 kg). 

It follows from the way a system of units is constructed that 
a change in the basic units leads to a change in the derived ones. 
If, for example, the minute is taken as the unit of time instead of 
the second, i.e. the unit of time is increased 60 times, then the unit 
of velocity will diminish 60 times, and the unit of acceleration will 
diminish 3600 times. 

The relation showing how a unit of a quantity changes when the 
basic units are changed is called the dimension of this quantity. 
The dimension of an arbitrary physical quantity is designated by 
its symbol placed in brackets. For example, [v] stands for the dimen- 
sion of velocity. Special symbols are used for the dimensions of the 
basic quantities: L for the length, M for the mass, and T for the 
time. Thus, designating the length by Z, the mass by m, and the 
time by t y we can write 

ID = L; [m] = M; [t] = T 

When these symbols are used, the dimension of an arbitrary 
physical quantity has the form L a M p T^ (a, p, and y may be either 
positive or negative, and in particular may equal zero). This nota- 
tion signifies that when the unit of length is increased n x times, the 
unit of a given quantity grows nf times (accordingly, the number that 
expresses the value of the quantity in these units diminishes n « 
times). When the unit of mass is increased n 2 times, the unit of the 
given quantity grows n\ times. Finally, when the unit of time is 
increased n 3 times, the unit of the given quantity grows n% times. 

Since physical laws cannot depend on the selection of the units 
for the quantities figuring in them, the dimensions of both sides of 
the equations expressing these laws must be the same. This condi- 
tion can be used, first, for verifying the correctness of physical 
relations obtained, and, second, for establishing the dimensions 
of physical quantities. For example, speed is determined by the 
formula v = As/ At. The dimension of As is L, and that of At is T. 
The dimension of the right-hand side of the above formula is 
[Asl/lAZ] = L/T = LT“*. The dimension of the left-hand side must 
be the same. Hence, 

[v} = LT-' (2.14) 

This relation is called a dimension formula, and its right-hand side — 
the dimension of the relevant quantity (in our case, of speed). 

The relation a — Av/At permits us to establish the dimension of 
acceleration: 

[Av] _ LT-i T m-. 

[Af] T ^ 1 


[a] 


(2.15) 
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The dimension of force is 

[F] = [m\ [a] = MLT“ 2 (2.16) 

The dimensions of all other quantities are established in a similar 
way. 


2.6. Newton’s Third Law 

Any action of bodies on one another has the nature of mutual 
interaction: if body 1 acts on body 2 with the force F 21 , then body 2 y 
in turn, acts on body 1 with the force F 12 . 

Newton’s third law states that the forces exerted by interacting 
bodies on each other are equal in magnitude and opposite in direction . 

Using the above symbols for such forces, the third 
law can be expressed in the form of the equation 

F, 2 =-F 21 (2.17) 

It follows from Newton’s third law that forces 
appear in pairs: for any force applied to a body 
there is another force equal in magnitude and 
opposite in direction applied to the second body 
interacting with the first one. 

Newton’s third law is not always correct. It 
is observed quite strictly in contact interactions 
(i.e. interactions observed upon the direct con- 
tact of bodies), and also when bodies at rest that 
are a certain distance apart interact with each 
other. 

As an example of the violation of Newton’s 
third law, we can take a system of two charged 
particles e 1 and e 2 moving at the given moment 
as shown in Fig. 2.1. It is proved in electrodyna- 
mics that apart from the force of electrostatic 
interaction F 12 obeying the third law, the mag- 
netic force F x will also be exerted on the first 
particle. Only the force F 21 equal to -F 12 acts on 
the second particle. The magnitude of the magnetic force acting on 
the second particle for the case shown in the figure equals zero. It 
must be noted that for speeds of particles that are much smaller 
than the speed of light in a vacuum (when v x c and v 2 c) the 

force F x is negligibly small in comparison with the force F 12 , so 
that Newton’s third law is virtually correct in this case 
too. 

Now let us consider a system of two electrically neutral particles 
m x and m 2 at the distance r from each other. Owing to universal 
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gravitation, these particles attract each other with the force 

F = G JW* (2.18) 


In this case, the particles interact via a gravitational field. Say, 
the first particle sets up in the space surrounding it a field which 
manifests itself in that the particle m 2 placed at a point of this 
field experiences a force of 
attraction to the first particle. 

Similarly, the second particle 
sets up a field which manifests 
itself in its action on the first 
particle. Experiments show 
that the changes in the field 
due, for instance, to a change 
in the position of the particle 
producing it propagate in space not instantaneously, but at a finite, 
though very high, speed equal to the speed of light in a vacuum c. 

Let us assume that the particles m 1 and m 2 are initially at rest 
at positions 1 and 2 (Fig. 2.2). The forces of interaction F 12 and F al 
are equal in magnitude and opposite in direction. Now assume that 
the particle m 1 moves very rapidly (with a speed almost equal to c ) 
to position 1\ At this point, the particle m 1 will experience the force 
F; 2 smaller in magnitude (r' > r) and directed differently than F„ 
(we remind our reader that the field of the particle m 2 remains con- 
stant). The force F 21 will continue to act on the second particle until 
the disturbance of the field due to the displacement of m 1 reaches 
point 2. Consequently, Newton’s third law was violated while the 
particle 7 ^ was in motion some time after it stopped at point 1 

If the particle m 1 moved from point 1 to point V with the speed v 
much smaller than c ( v <£ c), or the speed of propagation of field 
disturbances were infinitely great, then the instantaneous values of 
the field at point 2 would correspond to the positions of the particle 
nix at the same moment of time, and, consequently, no violations 
of the third law would be observed. 

Newtonian mechanics in general holds only for speeds that are 
much smaller than the speed of light (when u c). Therefore, within 
the confines of this mechanics, the speed of propagation of field dis- 
turbances is considered to be infinite, and Newton’s third law is 
always obeyed. 



2.7. Galileo’s Relativity Principle 


[ 7f J 


Let us consider two reference frames moving relative to each other 
with the constant velocity v 0 . One of these frames, designated in 
Fig. 2.3 by the letter K, will conditionally be considered fixed. There- 
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fore, the second frame K ' will move uniformly in a straight line* 
Let us choose the coordinate axes x, y, z of frame K and the axes x\ 
y\ z of frame K' so that the axes x and x ' coincide, while the axes y 
and y\ and also z and z are parallel to each other. 

Let us find the relation between the coordinates x, y, z of a point P 
in frame K and the coordinates x\ y\ z ' of the same point in frame K\ 
If we begin to count the time from the moment when the origins of 
the coordinates of the two frames coincided, then as follows from 
vFig. 2.3, we have x — x' + v 0 t. In addition, it is quite obvious that 

y — y' and z = V. Adding to 
these relations the assumption 
adopted in classical mechanics 
that the time flows the same 
in both frames, i.e. that 
t — t\ we get a group of four 
equations: 

x = x’+v 0 t’, y = y\ 
z = z\ t = t' (2.19) 

They are called Galilean 
transformations. 

The first and last of Eqs. (2.19) are correct only at values of v 0 
that are small in comparison with the speed of light in a vacuum c 
(i.e. v 0 < c). At values of v 0 comparable with c, the Galilean trans- 
formations must be replaced with the more general Lorentz trans- 
formations (see Sec. 8.2). Equations (2.19) are assumed to be accurate 
within the confines of Newtonian mechanics. 

Differentiating Eqs. (2.19) with respect to time, we find the rela- 
tion between the velocities of point P relative to the reference frames 
K and K‘\ 



X = x' +v 0 

y=y’ 

z = z' 


or v x = v' x + v 0 

or v y = v'y 
or v z = v t 


( 2 . 20 ) 


The three scalar relations (2.20) are equivalent to the following 
relation between the velocity vector v relative to frame K and the 
velocity vector v' relative to frame K': 


v = v' + v 0 (2.21) 

To convince ourselves in the truth of this equation, it is sufficient to 
project vector equation (2.21) onto the axes x, y, z. As a result, we 
get equations (2.20). 
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Equations (2.20) and (2.21) give the rule of velocity addition in 
classical mechanics. It must be borne in mind that Eq. (2.21), like 
any other vector equation, remains correct upon an arbitrary selec- 
tion of the mutual directions of the coordinate axes of the frames K 
and K' . Equations (2.20), however, are obeyed only when the axes 
are chosen as shown in Fig. 2.3. 

We noted in Sec. 2.2 that any reference frame moving relative to 
an inertial frame with a constant velocity will also be inertial. 
Now we are in a position to prove this statement. To do this, let us 
differentiate Eq. (2.21) with respect to time. Taking into account 
that v 0 is constant, we get 


v = v' or a = a' (2.22) 

Hence it follows that the acceleration of a body in all reference 
frames moving uniformly in a straight line relative to one another is 
the same. Therefore, if one of these frames is inertial (this signifies 
that in the absence of forces a = 0), then the others will also be 
inertial (a' also equals zero). 

The fundamental equation of mechanics (2.11) is characterized 
by containing only the acceleration of the kinematic quantities. 
It does not contain the velocity. As we have established above, how- 
ever, the acceleration of a body in two arbitrarily selected inertial 
reference frames K and K' is the same. Hence it follows according 
to Newton’s second law that the forces acting on a body in frames K 
and K' will also be the same. Consequently, the equations of dynamics 
do not change upon transition from one inertial reference frame to an - 
other one , i.e. they are said to be invariant with respect to the trans- 
formation of the coordinates corresponding to the transition from 
one inertial reference frame to another. From the viewpoint of 
mechanics, all inertial reference frames are absolutely equivalent, 
and none of them can be preferred to others. In practice, this mani- 
fests itself in that we cannot establish by any mechanical experiments 
conducted within the limits of a given reference frame whether it 
is in a state of rest or in uniform straight-line motion. For example, 
if we are in a car of a train running uniformly in a straight line with- 
out jolts^ we cannot determine whether the car is moving or at 
rest without looking out of a window. The free fall of bodies, the 
motion of objects that we throw, and all other mechanical processes 
in this case will occur in the same way as if the car were standing. 

These circumstances were already established by Galileo Galilei 
(1564-1642). The statement that all mechanical phenomena in differ- 
ent inertial reference frames proceed identically, owing to which 
no mechanical experiments allow us to determine whether the given 
reference frame is at rest or is moving uniformly in a straight line, 
is called Galileo’s relativity principle. 
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2.8. Forces 


Four kinds of interactions are distinguished in modern physics: 
(1) gravitational (or interaction due to universal gravitation), (2) 
electromagnetic (achieved via electric and magnetic fields), (3) strong 
or nuclear (ensuring the binding of particles in an atomic nucle- 
us), and (4) weak interaction (responsible for many processes of 
elementary particle decay). 

Within the confines of classical mechanics, we have to do with 
gravitational and electromagnetic forces, and also with elastic and 
friction forces. The latter two kinds of forces are determined by the 
nature of the interaction between the molecules of a substance. The 
forces of interaction between molecules have an electromagnetic 
origin. Consequently, elastic and friction forces are electromagnetic 
in their nature. 

Gravitational and electromagnetic forces are fundamental — they 
cannot be reduced to other simpler forces. Elastic and friction forces, 
on the other hand, are not fundamental. 

The laws of the fundamental forces are exceedingly simple. The 
magnitude of the gravitational force is determined by Eq. (2.18). 
The magnitude of the force with which two point charges q x and q 2 
at rest interact is determined by Coulomb’s law: 


F = kl i|s- 

r 2 


(2.23) 


(k is a constant of proportionality depending on the units chosen 
for the quantities in the formula). 

If the charges are moving, then magnetic forces act on them in 
addition to the force defined by Eq. (2.23). The magnetic force acting 
on a point charge q moving with the velocity v in a magnetic field of 
induction B is determined by the formula 


F = k'q [vB] (2.24) 

(fc' is a constant of proportionality). 

Equations (2.18), (2.23), and (2.24) are accurate ones . For elastic 
and friction forces we can obtain only approximate empirical for- 
mulas that are considered in the following sections. 


2.9. Elastic Forces 

Any real body becomes deformed, i.e. changes its dimensions and 
shape, under the action of forces applied to it. If the body regains 
its initial dimensions and shape after the action of the forces stops, 
the deformation or strain is called elastic. Elastic deformations are 
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observed when the force producing the deformation does not exceed 
a definite limit, called the elastic limit, for each concrete body. 

Let us take a^ spring of length l 0 in its undeformed state and apply 
the forces F x and F 2 to its ends that are equal in value and opposite 
in direction (Fig. 2.4). Under the action of these forces, the spring 
will stretch over a certain distance A Z, after which equilibrium will 
set in. In the state of equilibrium, the external forces F x and F t 
will be balanced by the elastic forces 

set up in the spring as a result ^ 

of its deformation. Experiments v < ^ 0 0 0 0 0 0 ^ 

show that with small deformations, 

the stretching of the spring A l is Fig. 2.4 

proportional to the stretching force: 

Al oc F (here F = F x = F 2 ). Accordingly, the elastic force is 
proportional to the elongation of the spring: 

F=kAl (2.25) 

The constant of proportionality k is called the spring constant. 

The statement that the elastic force and deformation are propor- 
tional to each other is called Hooke’s law. 

Elastic strains are set up throughout the entire spring. Any part 
of the spring acts on another part with a force determined by Eq. 



Fig. 2.5 Fig. 2.6 

(2.25). Therefore, if we cut the spring in half, an identical elastic 
force will appear in each half with the elongation being half the 
original value. Hence, we conclude that with a given material of 
the spring and a given coil size the magnitude of the elastic force 
is determined not by the absolute elongation of the spring A Z, but by 
its relative elongation A Z/Z 0 . 

Elastic strains, but of the opposite sign, are also set up when 
a spring is compressed. Let us generalize Eq. (2.25) as follows. We 
shall rigidly fix one end of a spring (Fig. 2.5) and shall consider the 

ol Ar\rro f mn r\ f fVin cnrinfr qc t Vi o o rkArH ina to r nf itc nthor on rl moo cn rorl 
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from its position corresponding to the undeformed spring *. In 
addition, by F we shall understand the projection of the elastic 
force F ei onto the a:-axis. We can thus write that 

F = —kx (2.26) 

(inspection of Fig. 2.5 shows that the projection of the elastic force 
onto the x-axis and the coordinate x always have opposite signs). 

Homogeneous bars behave in tension or uniaxial compression like 
a spring. If we apply the forces F x and F 2 (F x = F 2 = F) to the ends 
of a bar, these forces being directed along its axis and acting uni- 
formly over the entire cross section, then the length of the bar Z 0 
will receive either a positive (in stretching) or a negative (in compres- 
sion) increment ** A l (Fig. 2.6). It is quite natural to take the rela- 
tive change in the length of the bar as the quantity characterizing its 
deformation: 


e 



(2.27) 


Experiments show that for bars of a given material the relative 
elongation or strain in elastic deformation is proportional to the 
force per unit cross-sectional area of the bar: 

8 = a ^ (2 ' 28) 

The constant of proportionality a is called the compliance coeffi- 
cient. 

The quantity equal to the ratio between the force and the area of 
the surface it is acting on is called the stress. Owing to the interac- 
tion of the parts of a body with one another, the stress is transmitted 
to all points of the body — the entire volume of the body, for exam- 
ple a bar, will be in a stressed state. If the force is directed along 
a normal to the surface, the stress is called normal. If it is directed 
along a tangent to the surface it is acting on, the stress is called 
tangential ( shear ). The normal stress is designated by the symbol o y 
the tangential or shear stress by the symbol t. 

The ratio FIS in Eq. (2.28) is the normal stress o. Hence, this 
equation can be written in the form 

e = a a (2.29) 


In addition to the compliance coefficient a, the elastic properties of 
a material are also characterized by its reciprocal E = 1/a, which 


* In Fig. 2.56, the distance over which the end of the spring moved is des- 
ignated — x. The reason is that the distance is a positive quantity, while the 
coordinate x in this case, however, is a negative one. 

** A change in the length of the bar is attended by a corresponding change 
in its cross-sectional dimensions. 
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is called the modulus of elasticity or Young’s modulus. It is mea- 
sured in pascals (the pascal is the unit of pressure in the SI system — 
1 Pa = 1 N/m 2 ). 

Using HE instead of a in Eq. (2.29), we get 


(2.30) 


from which it follows that Young’s modulus equals such a normal 
stress at which the relative elongation or strain will equal unity 
(i.e. the increment of the length A l will be equal to the original 
length l 0 ) if so great elastic deformations were possible (actually 
a bar will fail at considerably smaller stress- 
es, and the elastic limit is reached still 
earlier). 

Solving Eq. (2.28) with respect to F and 
using A l/l 0 instead of e and 1/E instead of 
a, we get 

F = Al — kAl (2.31) 

£ 0 

where ft is a constant quantity for a given 
bar. Equation (2.31) expresses Hooke’s law 
for a bar [compare with Eq. (2.26)1. Do not 
forget that this law is obeyed only until the elastic limit is reached. 

In conclusion, let us briefly consider shear strain. Let us take 
a homogeneous body having the shape of a rectangular parallelepi- 
ped and apply to its opposite faces the forces F x and F 2 (^i = F 2 = 
= F) directed parallel to these faces (Fig. 2.7). If the action of the 
forces is uniformly distributed over the entire surface of the 
corresponding face, then in any cross section parallel to these faces 
the tangential (shear) stress 
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(2.32) 


will appear (S is the area of a face). The action of the stresses will 
cause the body to deform so that one face will move relative to 
another over the distance a . If we mentally divide the body into 
elementary layers parallel to the faces we are considering, then each 
layer will be shifted with respect to its neighbours. For this reason, 
such deformation is called shear. 

In shear, any straight line originally perpendicular to the layers 
will turn through the angle (p. Shear is characterized by the quantity 


y = -£- = tan <p (2.33) 

called the relative shear (what a and b are is clear from Fig. 2.7). 
Upon elastic deformations, the angle <p is very small. We can there- 
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fore assume that tan q> « <p. Consequently, the relative shear y 
equals the angle of shear <p. 

Experiments show that the relative shear is proportional to the 
tangential stress: 

y = 4" t (2-34) 

The coefficient G depends only on the properties of a material and 
is called the shear modulus. It equals such a tangential (shear) 
stress at which the angle of shear will be 45 degrees (tan <p = 1) if the 
elastic limit were not exceeded at such great deformations. The shear 
modulus G, like Young’s modulus E , is measured in pascals (Pa). 


2.10. Friction Forces 

Forces of friction appear when contacting bodies or their parts 
move relative to one another. The friction occurring in the relative 
movement of two contacting bodies is called external; the friction 
between parts of the same continuous body (for example, a fluid) is 
called internal. 

The force of friction appearing when a solid body moves relative 
to a fluid (liquid or gas) medium must be related to the category of 
internal friction forces because in this case the layers of the fluid in 
direct contact with the body are carried along with it at the body’s 
velocity. The motion of the body is influenced by the friction between 
these layers of the fluid and other of its layers that are external rela- 
tive to them. 

Friction between the surfaces of two solids in the absence of any 
intermediate layer, for instance, a lubricant between them, is called 
dry. Friction between a solid and a fluid, and also between the 
layers of a fluid, is called viscous (or liquid). 

Two kinds of dry friction are distinguished: sliding and rolling. 

Forces of friction are directed along a tangent to the surfaces (or 
layers) in contact so that they resist the relative displacement of 
these surfaces (layers). If, for example, two layers of a liquid slide 
over each other with different velocities, then the force applied to the 
faster layer is directed oppositely to the direction of motion, while 
the force acting on the slower layer is directed along its motion. 

Dry Friction. In dry friction, a force of friction appears not only 
when one surface slides over another one, but also when attempts 
are made to set up such sliding motion. In the latter case, we have 
to do with the force of static friction. Let us consider two contacting 
bodies 1 and 2 of which the latter is fixed in place (Fig. 2.8). Body 1 
is pressed against body 2 by the force F n directed along a normal to 
the surface of contact of the bodies. It is called the normal force 
and may be due to the pressure of the body’s weight, or to other 
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reasons. Let us try to move body 1 by acting on it with an external 
force F. We shall find that for every concrete pair of bodies and 
every value of the normal force there is a definite minimum value F 0 
of the force F at which body 1 first begins to move. At values of the 
external force F within the limits from 0 to F 0 , the body remains at 
rest. According to Newton’s second law, this is possible if the force 
F is balanced by a force equalling it in magnitude and opposite 
in direction, which is exactly the force of static friction Ff r (see 
Fig. 2.8). It automatically* acquires 
a value equal to that of the external 
force F (provided that the latter does 
not exceed F 0 ). The quantity F 0 is the 
maximum possible value of the force 
of static friction. 

It must be noted that in accordance 
with Newton’s third law body 2 must Fig. 2.8 

also experience the force of static fric- 
tion Ff' r (it is shown by a dashed arrow in Fig. 2.8) equal in mag- 
nitude to the force Ff r , but directed oppositely. 

If the external force F exceeds F 0 in magnitude, the body begins 
to slide. Its acceleration is determined by the resultant of two forces: 
the external one F and the force of sliding friction F tr whose 

magnitude depends to a certain 
fa { 1 extent on the sliding speed. The 

nature of this relation is deter- 
- . ■ — mined by the nature and state of the 

0 contacting surfaces. The kind of 

the speed dependence of the force 
of friction shown in Fig. 2.9 is 
encountered most frequently. The 

-r graph shows both static and sliding 

ff U friction. The force of static fric- 



tion, as we have already noted. 
Fig. 2.9 may range from 0 to F 0J which is 

shown by the vertical line in the 
graph. In accordance with Fig. 2.9, the force of sliding friction first 
diminishes somewhat with increasing speed, and then begins to grow. 

With special processing of contacting surfaces, the force of slid- 
ing friction may be virtually independent of the speed. In this 
case, the curved portion of the graph in Fig. 2.9 will transform into 
a horizontal line beginning at the point F 0 . 

The laws of dry friction consist in the following: the maximum 
force of static friction, and also the force of sliding friction do not 


* This occurs in the same way as a sprint acted upon by a stretching force 
automatically acquires an elongation such that the elastic force balances the 
external one. 
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depend on the area of contact between bodies and are approximately 
proportional to the magnitude of the normal force pressing the 
contacting surfaces together: 

Ftr = fF n (2.35) 

The dimensionless proportionality constant / is called the coeffi- 
cient of friction (static or sliding friction, as the case may be). It 
depends on the nature and state of the contacting surfaces, particu- 
larly on their roughness. The coefficient of sliding friction is a func- 
tion of speed. 

Friction forces play a very great part in nature. Friction is often 
a great help to us in our everyday life. Let us remember the tre men - 
dous difficulties encountered by pedestrians and vehicles on ice- 
* covered p avements and roads, when the friction between the pave- 
tf^nent surface and the pedestrians’ soles or the wheels of the vehicles 
considerably diminishes. If there were no friction forces, our furni- 
ture would have to be fastened to the floor like on a ship on a rolling 
sea because upon the most minute deviation of the floor from a hori- 
zontal position it would slide in the direction of the slope. Our reader 
can give numerous similar examples of how helpful friction is. 

The part played by friction is often extremely negative, and meas- 
ures have to be taken to reduce it as much as possible. This relates, 
for example, to the friction in bearings or between the hub of a wheel 
and an axle. 

The most radical way of reducing forces of friction is to replace 
sliding friction with rolling friction. The latter appears, for exam- 
ple, between a cylindrical or spherical body rolling over a flat or 
curved surface. Rolling friction formally obeys the same laws as 
sliding friction, but the coefficient of friction in this case is much 
lower. 

Viscous Friction and Resistance of the Medium. Unlike dry fric- 
tion, viscous (internal) friction is characterized by the force of 
viscous friction vanishing together with the velocity. Therefore, 
no matter how small an external force is, it can impart a relative 
velocity to the layers of a viscous medium. The laws which the forces 
of friction between the layers of a medium obey will be consid- 
ered in the chapter devoted to fluid mechanics (Chap. 9). 

In this section, we shall limit ourselves to a treatment of the 
friction forces between a solid and a viscous (fluid) medium. It must 
be borne in mind that apart from the forces of friction proper, the 
motion of bodies in a fluid is attended by the so-called forces of 
resistance of the medium that can be much greater than the forces 
of friction. We have no possibility of considering the causes of these 
forces in detail. We shall only treat the laws obeyed jointly by 
forces of friction and resistance of the medium. We shall conditionally 
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call the total force the force of friction* The speed dependence of 
this force is shown in Fig. 2.10. 

* At low velocities, the force grows linearly with the velocity: 

F fr = —k t y (2.36) 

(the minus sign si gnifies that the force is directed oppositely to the 
velocity). The value of the coefficient k x depends on the shape and 
dimensions of a body, the state of its surface, and on the prop- 
erty of the fluid called its 
viscosity. For example, this /- 

coefficient is much higher for f 

glycerine than for water. 

At high velocities, the linear 
law transforms into a quadratic 
one, i.e. the force begins to grow 
in proportion to the square of the 
velocity: 

Ff r = — k 2 v 2 e v (2.37) 

(e 0 is the unit vector of the ve- 
locity). The value of the coeffi- 
cient k 2 depends on the shape Fig. 2.10 

and dimensions of a body. 

The magnitude of the velocity at which the law (2.36) transforms 
into (2.37) depends on the shape and dimensions of a body, and also 
on the viscosity and density of the fluid. 


2.11. Force of Gravity and Weight 

The force of attraction to the Earth causes all bodies to fall with 
the same acceleration relative to the Earth’s surface, which is desig- 
nated by the symbol g. This signifies that in a reference frame associ- 
ated with the Earth, any body of mass m is acted upon by the force 

P = mg (2.38) 

called the force of gravity*. When a body is at rest relative to the 
Earth’s surface, the force P is balanced by the reaction** F r of the 
suspension or support preventing falling of the body (F r = — P). 
According to Newton’s third law, the body in this case acts on the 

* Owing to the non-inertiai nature of a reference frame associated with 
the Earth, tne force of gravity will differ somewhat from the force with which 
a body is attracted to the Earth. This will he treated in greater detail in Sec. 
4.2. 

** Reactions are forces with which a given body is acted upon by bodies 
restrieting its motion. 
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suspension or support with the force W equal to — F r , i.e. with the 
force 

W = P = mg 

The force W with which a body acts on its suspension or support 
is called the weight of the body. This force equals mg only when 
the body and its support (or suspension) are stationary relative to 
the Earth. If they are moving with a certain acceleration a, their 
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weight W will not equal mg. This can be explained by the following 
example. Let a suspension in the form of a spring fastened to a frame 
move together with a body with the acceleration a (Fig. 2.11). The 
equation of motion of the body will therefore have the form 

p + F r = ma (2.39) 

where F r is the reaction of the suspension, i.e. the force with which 
the spring acts on the body. According to Newton’s third law, the 
body acts on the spring with a force equal to — F r , which by definition 
is the weight of the body W in these conditions. Substituting the 
force — W for the reaction F r and the product mg for the force of 
gravity P in Eq. (2.39), we get 

W = m (g - a) (2.40) 

Formula (2.40) determines the weight of a body in the general case. 
It holds for a suspension or a support of any kind. 

Let us assume that our body and the suspension are moving in 
a vertical direction (Fig. 2.11 is based on this assumption). 

We project Eq. (2.40) onto the direction of a plumb line: 

W = m (g ± a) 


(2.41) 
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In this expression, W , g , and a are the magnitudes of the correspond- 
ing vectors. The plus sign corresponds to a directed upward, and 
the minus sign to a directed downward. 

It follows from Eq. (2.41) that the magnitude of the weight W 
may be either greater or smaller than the force of gravity P. In 
free fall of the frame with the suspension, a = g, and the force W 
with which the body acts on the suspension vanishes. A state of 
weightlessness sets in. A spaceship orbiting around the Earth with 
its engines switched off travels, like the freely falling frame, with 
the acceleration g. As a result, the bodies inside it are in a state of 
weightlessness — they exert no pressure on ’ the bodies in contact 
with them. 

It must be noted that the force of gravity P is often confused with 
the weight of a body W. This is due to the fact that with a station- 
ary support, the forces P and W coincide both in magnitude and in 
direction (they both equal mg). It must be remembered, however, 
that these forces are applied to different bodies: P is applied to a body 
itself, whereas W is applied to the suspension or support restricting 
the free motion of the body in the field of the Earth’s gravitational 
forces. In addition, the force P always equals mg regardless of wheth- 
er the body is moving or at rest, whereas the force of weight W 
depends on the acceleration with which the support and the body are 
moving. It may be either greater or smaller than mg, and, in partic- 
ular, in a state of weightlessness it vanishes. 

The relation (2.40) between the mass and the weight of a body 
provides a way of comparing the masses of bodies by weighing them: 
the ratio of the weights of bodies determined in identical conditions 
(usually at a = 0) at the same point on the Earth’s surface equals 
the ratio of the masses of these bodies: 

W x : W 2 : W 3 m 2 : m,: . . . 

It will be shown in Sec. 4.2 that the acceleration of free fall g and 
the force of gravity P depend on the latitude of a locality. They also 
depend on the altitude, and diminish with an increasing distance 
from the centre of the Earth. 


2.12. Practical Application 
of Newton’s Laws 

To compile an equation of motion, we must first of all establish 
what forces act on the body being considered. It is necessary to deter- 
mine the action of other bodies on the given one that must be taken 
into account. For example, for a body sliding down an inclined plane 
(Fig. 2.12), the action exercised by the Earth is important (it is char- 
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acterized by the force mg), and also the action exercised by the 
plane (it is characterized by the force of the reaction F r ). 

Never take into account “moving", “rolling down”, “centripetal", 
“centrifugal”* and similar forces. To prevent an error, characterize 
a force according to the “source” causing it to appear, and not accord- 
ing to the action it produces. This means that behind every force 
we must see the body whose action sets up the force. This will elimi- 
nate the typical error consisting in that the same force is taken into 

account twice under different names. 

In the example we are considering 
(see Fig. 2.12), it is good to resolve 
the force of the reaction F r into two 
components — the normal force F n and 
the friction force F fr . This, in particu- 
lar, is useful in connection with the 
fact that the force of friction is pro- 
portional to the magnitude of the 
force F n [see Eq. (2.35)]. 

Having determined the forces acting 
on a body, we write an equation of 
Newton’s second law. In our example, 
it has the form 

ma = mg -f- F r = mg + F n + F fr (2.42) 

To perform calculations, we must pass over from vectors to their 
projections onto the correspondingly chosen directions, using the 
following properties of projections: 

(1) equal vectors have identical projections; 

(2) the projection of a vector obtained by multiplying another 
vector by a scalar equals the product of the projection of this second 
vector and the scalar; 

(3) the projection of a sum of vectors equals the sum of the projec- 
tions of the vectors being added [see Eq. (1.8)1. 

Let us project the vectors in Eq. (2.42) onto the direction x shown 
in Fig. 2.12. The projections of the vectors are a x = a (a is the 
magnitude of the vector a), g x = g sin a, F nx = 0, F TX = — fF n = 
= — fmg cos a. Consequently, we arrive at the equation 

ma = mg sin a — fmg cos a 

from which it is a simple matter to find a. 

In more complicated cases, we have to project the vectors onto 
several directions and solve the resulting system of algebraic or 
differential equations. 


* This does not relate to the term “centrifugal force of inertia" (see Sec. 
4 . 2 ). 




CHAPTER 3 LAWS OF CONSERVATION 


3.1. Quantities Obeying 

the Laws of Conservation 

Bodies forming a mechanical system may interact with one an- 
other and with bodies not belonging to the given system. Accordingly, 
the forces acting on the bodies of a system can be divided into inter- 
nal and external ones. We shall define internal forces as the forces 
with which a given body is acted upon by the other bodies of the 
system, and external forces as those produced by the action of bodies 
not belonging to the system. If external forces are absent, the rele- 
vant system is called closed. 

There are functions of the coordinates and velocities of the par- 
ticles* forming a system for closed systems that retain constant values 
upon motion. These functions are called motion integrals. 

The number of motion integrals that can be formed for a system 
of N particles between which there are no rigid constraints is 6W — 1. 
Only those of them are of interest to us that have the property of 
additivity. This property consists in that the value of a motion 
integral for a system comprising parts whose interaction may be 
disregarded equals the sum of the values for each part. There are 
three additive motion integrals. The first is called energy, the sec- 
ond — momentum, and the third — angular momentum. 

Thus, three physical quantities do not change in closed systems, 
namely, energy, momentum, and angular momentum. Accordingly, 
there are three laws of conservation — that of energy conservation, 
that of momentum conservation, and that of angular momentum 
conservation. These laws are closely associated with the fundamental 
properties of space and time. 

The conservation of energy is based on the uniformity of time, 
i.e. the equivalence of all moments of time. The equivalence should 
be understood in the sense that the substitution of the moment of 
time t 2 for the moment t x without a change in the values of the coor- 
dinates and velocities of the particles does not change the mechanical 
properties of a system. This signifies that after such a substitution, 
the coordinates and velocities of the particles have the same values 
at any moment of time t 2 + t as they would have had before the 
substitution at the moment t ± + t. 

The conservation of momentum is based on the uniformity of 
space, i.e. the identical properties of space at all points. This should 


* We remind our reader that by a particle here we mean a point particle. 
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be understood in the sense that a translation of a closed system from 
one place in space to another without changing the mutual arrange- 
ment and velocities of the particles does not change the mechanical 
properties of the system (it is assumed that the closed nature of the 
system is not violated at the new place). [cU 

Finally, the conservation of angular momentum is based on the 
isotropy of space, i.e. the identical properties of space in all direc- 
tions. This should be understood in the sense that rotation of a 
closed system as a whole does not affect its mechanical properties. 

The laws of conservation are a powerful means of research. It 
is often extremely difficult to accurately solve equations of motion. 
In these cases, the laws of conservation permit us to obtain numerous 
important data on how mechanical phenomena proceed without 
having to solve equations of motion. The laws of conservation do 
not depend on the nature of the acting forces. This is why they can 
help us obtain much important information on the behaviour of 
mechanical systems even when the forces are unknown. 

In the following sections, we shall obtain the laws of conservation 
on the basis of Newton’s equations. It must be borne in mind, how- 
ever, that the laws of conservation have a much more general nature 
than Newton’s laws. The laws of conservation remain strictly correct 
even when Newton’s laws (particularly the third one) are violated. 
We stress the fact that the laws of energy, momentum, and angular 
momentum conservation are accurate laws that are also strictly 
obeyed in the relativistic re alm . - c Jbowniu > 


3.2. Kinetic Energy 

Let us now pass over to finding the additive integrals of motion. 
We shall first consider the simplest system consisting of a single 
point particle. 

The equation of motion of the particle is 

my = F (3.1) 

Here F is the resultant of the forces acting on the particle. Multi- 
plying Eq. (3.1) by the displacement of the particle ds = v dt y we get 

myy dt= F ds (3.2) 

The product v dt is the increment of the velocity of the particle dv 
during the time dt . Accordingly 


m\y dt — my dv = md ( -y-) = d (-y-) 


(3.3) 
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[see Eq. (2.50)]. Performing such a substitution in Eq. (3.2), we 
arrive at the expression 



(3.4) 


If the system is closed, i.e. F = 0, then d (mv 2 / 2) = 0, while the 
quantity 



(3.5) 


itself remains constant. This quantity is called the kinetic energy 
of the particle. For an isolated particle, the kinetic energy is an 
integral of motion *. 

Multiplying the numerator and denominator of Eq. (3.5) by m 
and taking into consideration that the product mv equals the momen- 
tum p of a body, the expression for the kinetic energy can be given 
the form 



(3.6) 


If the force F acts on a particle, its kinetic energy does not remain 
constant. In this case in accordance with Eq. (3.4), the increment 
of the particle’s kinetic energy during the time dt equals the scalar 
product F ds (ds is the displacement of the particle during the time dt). 
The quantity 

dA=Fds (3.7) 

is called the work done by the force F over the path ds (ds is the 
magnitude of the displacement ds). The scalar product (3.7) can be 
represented as the product of the projection of the force onto the 
direction of the displacement F 8 and the elementary distance ds . 
Consequently, we can write that 

dA=F g ds (3.8) 

It is clear from the above that work characterizes the change in ener- 
gy due to the action of a force on a moving particle. 

Let us integrate Eq. (3.4) along a certain trajectory from point 1 
to point 2: 

WWf* 

l l 


* For a single isolated particle, any power of the velocity remains constant* 
But for a system of several interacting particles, it is exactly quantities of the 
form of Eq. (3.5) that are addends in the additive integral of motion. 
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The left-hand side is the difference between the values of the kinetic 
energy at points 2 and i, i.e. the increment* of the kinetic energy 
along path 1-2 . Taking this into account, we get: 


2 



i 


The quantity 

2 2 

4= j Fds= j (3.10) 

i 1 

is the work of the force F over path 1-2 . We shall sometimes denote 
this work by the symbol A 12 instead of A. 

Thus, the work of the resultant of all the forces acting on a particle 
produces an increment of the particle's kinetic energy. 

A i2 = Ex 2 — i (3.11) 

It follows from Eq. (3.11) that energy has the same dimension as 
work. Accordingly, energy is measured in the same units as work 
(see the following section). 


3.3. Work 

Let us consider the quantity that we called work in greater detail. 
Equation (3.7) can be written in the form 

dA=Fds = F cos a *ds (3.12) 

where a is the angle between the direction of the force and that of 
the displacement of the point of application of the force. 

If the force and the direction of the displacement make an 
acute angle (cos a > 0), the work is positive. If the angle a is obtuse 
(cos a < 0), the work is negative. When a = n/2, the work equals 
zero. This especially clearly shows that the concept of work in mechan- 


* The change in a quantity a can be characterized either by its increment or 
its decrement. The increment of the quantity a, which we shall designate by 
Aa, is defined as the difference between the final (a 2 ) and initial (a 2 ) values of 
this quantity: 

increment = Aa = a % — a x 

The decrement of the quantity a is the difference between its initial (a x ) 
and final ( a 2 ) values: 

decrement = a ± — a t » — Aa 

The decrement of a quantity equals its increment with the opposite sign. The 
increment and decrement are algebraic quantities. 
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ics appreciably differs from our ordinary notion of it. In the ordin- 
ary meaning, any effort, particularly muscular strain, is always attend- 
ed by work being done. For example, in order to hold a heavy load 
while standing still, and, moreover, to carry this load along a hori- 
zontal path, a porter spends much effort, i.e. “does work”. The work 
as a mechanical quantity in these cases, however, equals zero. 

Figure 3.1 is a plot of the projection of the force onto the direction 
of displacement F s as a function of the position of the particle on its 
trajectory (the axis of abscissas has been taken as the 5 -axis, the 
length of the part of this axis between 
points 1 and 2 equals the total length 
of the path). Examination of the figure 
shows that the elementary work dA=F s x 
xds equals numerically the area of the 
hatched strip, while the work A over 
path 1-2 equals numerically the area of 
the figure confined by the curve F s , the 
vertical lines from points 1 and 2 and 
the 5 -axis (compare with Fig. 1.26). 

Let us use this result to find the work 
done in the deformation of a spring 
obeying Hooke’s law [see Fig. 2.5 and Eq. (2.26)1. We shall 
begin with stretching of the spring. We shall do this very slowly so 
that the force F ex t which we act on the spring with may be considered 
equal in magnitude to the elastic force F e i all the time. Hence, 
F x> ext = — F Xt e i =kx y where x is the elongation of the spring 
(Fig. 3.2). A glance at the figure shows that the work required to cause 
the elongation x of the spring is 



A 


kx* 

2 


(3.13) 


When the spring is compressed by the amount x, work of the same 
magnitude and sign is done as in stretching by x. The projection of 
the force F ex t in this case is negative (F ex t is directed to the left, x 
grows to the right, see Fig. 3.2), and all the dx s are also negative. 
As a result, the product F x> ext dx is positive. 

In a similar way, we can find an expression for the work done 
upon the elastic stretching or compression of a bar. According to 
Eq. (2.31), this work is 

4 = 4--^ (Ai) 2 = Y^5/ 0 (-^-) 2 = -i J BFe* (3.14) 

where V = Sl 0 is the volume of the bar, and e = Al/l 0 is the relative 
elongation [see Eq. (2.27)1. 

Assume that several forces whose resultant is F = 2 Fj act simul- 

i 

taneously on a body. It follows from the distributivity of a scalar 
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product of vectors [see Eq. (1.20)] that the work dA done by the resul- 
tant force over the path ds can be represented in the form 

dA = (S F*) ds = 2 Fi ds = 2 dA t (3.15) 

i I i 

This signifies that the work of the resultant of several forces equals 
the algebraic sum of the work done by each force separately. 



Fig. 3.2 


The elementary displacement ds can be represented as v dt. We 
can therefore write the expression for the elementary work in the 
form 

d.4 = Fv dt (3.16) 

The work done during the interval from t x to t 2 can thus be calculated 
by the formula 

y 

A = \ Fv dt (3.17) 

<» 

In accordance with Eq. (1.21), we have F ds = F ds F , where ds F is 
the projection of the elementary displacement ds onto the direction 
of the force F. The formula for work can therefore be written as 
follows: 

dA = F ds F (3.18) 

If the force has a constant magnitude and direction (Fig. 3.3), 
then the vector F in the expression for work may be put outside the 
•ntegral. The result is 

2 

.4 = F f ds = Fs =« Fs f 

i 


(3.19) 
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where s is the vector of the displacement from point 1 to point 2 % 
and s F is its projection onto the direction of the force. 

The work done in unit time is called power. If the work dA is 
done in the time dt , then the power is 

P = ~ ( 3 . 20 ) 

Taking dA as given by Eq. (3.16), we get the following expression 
for the power: 

P = Fv (3.21) 

according to which the power equals the scalar 
product of the force vector and the vector of the 
velocity with which the point of application of 
the force is moving. 

Units of Work and Power. The unit of work is 
the work done by a force equal to unity and 
acting in the direction of the displacement over 
a unit distance. Consequently, 

(1) in the SI system, the unit of work is the 
joule (J) — the work done by a force of 1 N over 
a distance of 1 m; 

(2) in the cgs system, the relevant unit is the erg — the work done 
by a force of 1 dyn over a distance of 1 cm; 

(3) in the mkg(force)s system, the unit is the kilogramme(force) X 
X m(kgf -m) — the work done by a force of 1 kgf over a distance of 1 m. 

The units of work are related as follows: 

1 J = 1 N X 1 m = 10 5 dyn X 10 2 cm = 10 7 erg 
1 kgf-m — 1 kgf X 1 m = 9.81 N X 1 m = 9.81 J 

The unit of power is the power at which 1 unit of work is done in 
unit time. The unit of power in the SI system is the watt (W) equal 
to one joule per second (J/s). The unit of power in the cgs system 
(erg/s) has no special name. The relation between the watt and the 
erg/s is 1 W = 10 7 erg/s. 

The unit of power in the mkg(force)s system is the (metric) horse- 
power (hp), equal to 75 kgf • m/s, 1 hp=736 W (do not confuse this 
unit with the British or U.S. horsepower equal to 550 ft-lb/s or 
746 W). 

A system of prefixes is used, especially in the SI system, to denote 
multiples and submultiples of units. The names and symbols of 
these prefixes and the relevant factor by which the basic unit is 
multiplied are indicated in Table 3.1. 

For example, the unit of work called the megajoule is equivalent 
to 10 6 joules (1 MJ = 10 6 J), and the unit of power called the micro- 

c -7-7 r\r 
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Table 3.1 . Prefixes for Multiples and Submultiples of Units 


Name 

Symbol 

Factor by 
which unit 
is multi- 
plied 

Name 

Symbol 

Factor by 
which unit 
is multi- 
plied 

Tera 

T 

10 1 * 

Centi 

C 

io-* 

Giga 

G 

10® 

Mill! 

m 

io-* 

Mega 

M 

10® 

Micro 

P 

10-® 

Kilo 

k 

10 3 

Nano 

n 

10-» 

Hecto 

h 

10® 

Pico 

P 

10-1* 

Deca 

da 

10 l 

Femto 

f 

10-!* 

Deci 

d 

10- 1 

Atto 

a 

10-1* 


watt is equivalent to 10“ 6 watt (lp,W = 10 -6 W). Similarly, 1 micro- 
meter (formerly called the micron) is equivalent to 10 -6 m (lpm = 
= IQ" 6 m), and 1 pN = 10" 12 N. 


3.4. Conservative Forces 1 ^ 

If a particle is subjected to the action of other bodies at every 
point of space, the particle is said to be in a field of forces. For exam- 
e pie, a particle near the Earth’s surface is in 
the field of gravity forces — at every point of 
space the force P = mg acts on it. 

Let us consider as a second example the 
charged particle e in the electric field set up 
by the fixed point charge q (Fig. 3.4). A feature 
of this field is that the direction of the force 
acting on the particle at any point of space 
passes through a fixed centre (the charge q ), 
while the magnitude of the force depends only 
on the distance to this centre, i.e. F —F (r) 
[see Eq. (2.23)1. A field of forces with such prop- 
erties is called a central one. 

If at every point of a field the force acting on a particle is identi- 
cal in magnitude and direction (F = const), the field is called 
homogeneous. 

A field that changes with time is called non-stationary. A field 
that remains constant with time is called stationary. , 

For a stationary field, the work done on a particle by the forces 
of the field may depend only on the initial and final positions of thei 
particle and not depend on the path along which the particle moved. 
Forces having such a property are called conservative. 



Fig. 3.4 
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It follows from the work of conservative forces being independent 
of the path that the work of such forces along a closed path equals 
zero. To prove this, let us divide an arbitrary closed path into two 
parts: path / along which a particle passes from point 1 to point 2 f 
and path II along which the particle passes from point 2 to point 1 
(Fig. 3.5). We have chosen points 7 and 2 arbitrarily. The work along 
the entire closed path equals the sum of the work done on each of 
the parts: 

A — + (A 2 i)n (3.22) 

It is easy to see that the work C4 2 i) n differs from (A 12 ) u only in 
its sign. Indeed, reversing of the direction of motion results in da 




being replaced with — ds, and as a consequence the value of the 
integral ^ F ds reverses its sign. Thus, Eq. (3.22) can be written in 
the form 

4 = (^12)1 — (^12)11 

and since the work does not depend on the path, i.e. (<4 i 8 )i = (-4 12 )n, 
we arrive at the conclusion that -4=0. 

From the equality to zero of the work over a closed path, it is 
easy to obtain that the work 4 12 is independent of the path. This 
can be done by reversing the above reasoning. 

Thus, conservative forces can be defined in two ways: (1) as forces 
whose work does not depend on the path along which a particle 
passes from one point to another, and (2) as forces whose work along 
any closed path equals zero. 

We shall prove that the force of gravity is conservative. This force 
at any point has the same magnitude and direction — vertically 
downward (Fig. 3.6). Therefore, regardless of the path along which 
the particle moves (for example / or II in the figure), the work A 1% 
according to Eq. (3.19) is determined by the expression 

A i2 — mgSi 2 = rng (si 2 )pr, g 
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Inspection of Fig. 3.6 shows that the projection of the vector s 12 onto 
the direction g equals the difference between the heights — h t . 
Hence, the expression for the work can be written in the form 

A i2 = rag {h i — h 2 ) (3.23) 


This expression obviously does not depend on the path. Hence it 
follows that the force of gravity is conservative. Q JE 

It is a simple matter to see that the same result is obtained for 
any stationary homogeneous field. 

The forces acting on a particle in a central field are also conserva- 
tive. By Eq. (3.18), the elementary work over the path ds (Fig. 3.7) is 



dA — F (r) ds F 

But the projection of ds onto the direc- 
tion of the force at a given point, i.e. 
onto the direction of the position vec- 
tor r, is dr — the increment of the dis- 
tance from the particle to the force 
centre 0, namely, ds F == dr. Hence, 
dA — F (r) dr, and the work along the 
entire path is 

A lz =\F{r)dr (3.24) 

'I 


Fig * 3 7 Equation (3.24) depends only on the 

form of the function F (r) and on the 
values of r x and r 2 . It does not depend in any way on the form of the 
trajectory, whence it follows that the forces are conservative. 

For our reader not to form the erroneous idea that any force depend- 
ing only on the coordinates of a point is conservative, let us consid- 
er the following example. Assume that the components of a force 
are determined by the equations 


F x = ay, F y =—ax , F z = 0 (3.25) 


This force has a magnitude equal to F — ar, and is directed along 
a tangent to a circle of radius r (Fig. 3.8). Indeed, as follows from 
the figure, for a force of such a magnitude and direction, we have 


F x = ar cos — a) = ar sin a = ar-~ = ay, 

F y — ar cos (ji — a) = — ar cos a = — ar-^- = — ax 


which coincides with the values given by Eqs. (3.25). Let us take 
a closed path in the form of a circle of radius r with its centre at 
the origin of coordinates. The work of the force along this path 
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evidently equals F X 2nr = ar X 2nr = 2nar 2 , i.e. does not equal 
zero. Consequently, the force is not conservative. 

Forces of friction are typical non-conservative ones. Since the 
force of friction F and the velocity of a particle v are directed opposi- 
tely*, then the work of the force of friction on each part of the path 
is negative: 

dA = F ds = Fv dt = — Fvdt = — F ds< 0 

Therefore, the work along any closed 
path will also be negative (i.e. other 
than zero). Hence it follows that the 
forces of friction are not conservative. 

It must be noted that a field of conser- 
vative forces is a particular case of a 
potential force field. A field of forces is 
called potential if it can be described 
with the aid of the function V (a:, y, z, J), 
whose gradient [see the following section, 

Eq. (3.31)1 determines the force at each 
point of the field: F =V V [compare 
with Eq. (3.32)1. The function V is called 
the potential function or the potential. 

When a potential does not depend expli- 
citly on the time, i.e. V=V (, x , y, z), the potential field is station- 
ary, and its forces are conservative. In this case 

V (x, y, z) = E p (*, y, z) 

where E v (x> y, z) is the potential energy of a particle (see the fol- 
lowing section). 

For a non-stationary force field described by the potential 
V ( x , y, z, t ), the potential and conservative forces cannot be con- 
sidered identical. 



Fig. 3.8 


3.5. Potential Energy ' 
in an External Force Field 

Let us consider the case when the work of field forces does not 
depend on the path, but depends only on the initial and final posi- 
tions of a particle in the field. A value of a certain function E p (x, y, z) 
can be a ssigned to each point of the field suCh that the difference 

* Here we have in view friction between a moving body and a stationary 
(relative to the reference frame) one. The forces of friction may sometimes be 
positive. This occurs, for instance, when the force of friction is due to the inter- 
action of a given body with another one moving in the same direction, but with 
a higher velocity. 
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between the values of this function at points 1 and 2 will determine 
the work of the forces when the particle passes from the first point 
to the second one: 

A i2 = E v% i — E p% 2 (3.26) 

We can assign this function as^ follows. We take an arbitrary value 
of the function equal to 2? p>0 , f° r an initial point 0. We assign the 
value 

Ep (-P) — E Pt o “h-^p.o (3.27) 

to any other point P. Here A? t0 is the work done on a particle by the 
conservative forces when it is moved from point P to point 0. Since 
the work is independent of the path, the value of E p (P) determined 
in this way will be unambiguous. It must be noted that the function 
E p (P) has the dimension of work (or energy). 

In accordance with Eq. (3.27), the values of the function at points 
1 and 2 are 

Ep f 1 = ^p* 2 = ^P, 0 + ^20 

Let us form the difference between these values and take into ac- 
count that A 20 = — A 02 (see the preceding section). As a result, 
we get 

Ep, 1 — E p t 2 ~ A 10 A 20 = ^10 “f~^02 

The sum A l0 + A 02 gives the work done by the forces of the field 
when the particle moves from point 1 to point 2 along a trajectory 
passing through point 0. However, the work done to move the par- 
ticle from point 1 to point 2 along any other trajectory (including one 
not passing through point 0) will be the same. Hence, the sum A 10 + 
A 02 cai * be written simply in the form A 12 . As a result, we get 
Eq. (3.26). 

We can thus use the function E p to determine the work done on 
a particle by conservative forces along any path beginning at ar- 
bitrary point 1 and terminating at arbitrary point 2 . 

Assume that only conservative forces act on the particle. Con- 
sequently, the work done on the particle along path 1-2 can be repre- 
sented in the form of Eq. (3.26). According to Eq. (3.11), this work 
produces an increment of the kinetic en^rpv of the particle. We thus 
arrive at the equation 

Ex, 2 — E i = E v% 4 — E Pf 2 
whence it follows that 

E\l, 2 + E Pt 2 = E^ % 4 -f- Ep % 4 
The result obtained signifies that the quantity 

E = Ex + Ep 


(3.28) 
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for a particle in the field of conservative forces remains constant, 
i.e. is an integral of motion. 

It follows from Eq. (3.28) that E p is an addend in the motion 
integral having the dimension of energy. In this connection, the 
function E p (x, y, z) is called the potential energy of a particle in 
an external force field. The quantity E equal to the sum of the kine- 
tic and potential energies is called the total mechanical energy of 
the particle. 

According to Eq. (3.26), the work done on a particle by conserva- 
tive forces equals the decrement of the potential energy of the parti- 
cle. We can say in a different way that work i§ done at the expense 
of the store of potential energy. 

We can see from Eq. (3.27) that the potential energy is determined 
with an accuracy to a certain unknown additive constant 2? p , 0 - 
This circumstance is of no significance, however, because all physi- 
cal relations contain either the difference between the values of 2? p 
for two positions of a body, or the derivative of the function E p 
with respect to the coordinates. In practice, the potential energy of 
a body at a certain position is considered to equal zero, and the 
energy at other positions is taken with respect to this energy. 

Knowing the form of the function E p (, x , y , z), we can find the force 
acting on a particle at every point of a field. Let us consider the dis- 
placement of a particle parallel to the x-axis by the amount dx . This 
displacement is attended by work being done on the particle that is 
dA == F ds = F x dx (the displacement components dy and dz equal 
zero). According to Eq. (3.26), the same work can be represented as 
the decrement of the potential energy: dA = — dE v . Equating the 
two expressions for the work, we obtain 

F x dx=: — dE p 

whence 

dE p 

F x = jjj- (y = const, z = const) 


The expression in the right-hand side is the derivative of the func- 
tion^ (x, y, z) calculated on the assumption that the variables y and 
z remain constant, and only the variable x changes. Such derivatives 
are called partial ones and are denoted, unlike derivative functions 
of one variable, by the symbol dEp/dx. Consequently, the component 
of the force along the x-axis equals the partial derivative of the 
potential energy with respect to the variable x taken with the oppo- 
site sign: F x = — dEp/dx. Similar expressions are obtained for the 
components of the force along the y- and z-axes. Thus, 


dE P p dE p 

dz * v dy 1 


F z = 


dEp 


dz 


( 3 . 29 ) 
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Knowing its components, we can find the force vector: 

dE D dEn dE n 

F = F x e x + F y e y 4- F z e z = — e x e y e z (3.30) 

A vector having the components dcp /dx, dy/dy, dqldz, where <p is a 
scalar function of the coordinates x, y, z, is called the gradient of the 
function <p and is designated by the symbol grad q> or V<p (V stands for 
the nabla operator). It follows from the definition of the gradient that 




(3.31) 


A comparison of Eqs. (3.30) and (3.31) shows that the conservative 
force equals the gradient of the potential energy taken with the 
opposite sign: 

F = — VE P (3.32) 

Assume that a particle which the force (3.32) acts on moves 
over the distance ds having the components c?x, dy, dz . The force does 
the work 

/ dE p dEn dE rx v 

d A = Fds= - VEp ds=-(-^dx + -^-dy + -^dz) 

Taking into account that dA = — di? p , we get the following expres- 
sion for the increment of the function E p : 

dE D dE v dE n 

dE v = dx 4- dy 4 dz (3.33) 

An expression such as Eq. (3.33) is called the total differential of 
the relevant function. 

The concept of the total differential plays a great part in physics. 
For this reason, we shall devote a few lines to it. The total differential 
of the single-valued function / (x, y , z) is defined as the increment 
which this function receives in transition from a point with the 
coordinates x, y, z to a neighbouring point with the coordinates 
x4-^, y-\~dy, z~\-dz . By definition, this increment equals 

df (x, y , z)=/(x4- dx , y + dy, z + dz) — f (x, y, z) 

and, consequently, is determined only by the values of the function 
at the initial and final points. Hence, it cannot depend on the path 
along which the transition occurs. Let us take the broken line con- 
sisting of the segments dx , dy , dz as such a path (Fig. 3.9). On the 
segment dx , the function / (x, y, z) behaves like a function of one 
variable x, and receives the increment (df/dx) dx. Similarly, on the 
segments dy and dz , the function receives the increments (dffdy)dy 
and (dfldz) dz. The total increment of the function when passing from 
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the initial point to the final one thus equals 

df(x, y, z) = ^Ldx + -2Ldy + jLdz (3.34) 

We have arrived at the expression for the total differential [compare 
with Eq. (3.33)]. 

Not any expression of theTcind 

P (x, y , z)dx + Q (x, y t z)dy + R ( x , y , z) dz 

is a total differential of a certain function /(x, y, z). Particularly, the 
expression for the work done by the force whose projections are given 
by Eqs. (3.25) 

dA = ay dx — ax dy (3.35) 

is not a total differential because 
there is no such function E p for 
which — dE^ldx = ay , and 

— dE v /dy — — ax [see Eqs. (3.25)]. 

Correspondingly, there is no func- 
tion E p whose decrement would 
determine the work (3.35). 

It follows from the above that 
only forces complying with the 
condition (3.32) can be conservative, i.e. such forces whose compo- 
nents along the coordinate axes equal the derivatives of a certain 
function E p (x, y, z) with respect to the relevant coordinates taken 
with the opposite sign. This function is the potential energy of 
a particle. 

The concrete form of the function 2? p (x, y , z) depends on the 
nature of the force field. Let us find as an example the potential 
energy of a particle in a field of forces' of gravity. According to 
Eq. (3.23), the work done on a particle by the forces of this field is 

A l2 = mg (ht — kz) 



On the other hand, according to Eq. (3.26), 

-^12 = ^ p . i Ep, z 

Comparing these two expressions for the work, we arrive at the 
conclusion that the potential energy of a particle in a field of gravity 
forces is determined by the expression 

= m S h (3.36) 

where h is measured from an arbitrary level. 

The zero of potential energy may be chosen arbitrarily. Therefore, 
E p may have negative values. If we take the potential energy of 
a particle on the Earth’s surface as zero, for example, then the poten- 
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tial energy of a particle lying on the bottom of a pit with a depth of h' 
will be E p = — mgh' (Fig. 3.10). It must be noted that the kinetic 
energy cannot be negative. 

Assume that the non-conservative force F* acts on a particle in 
addition to conservative forces. Hence, when the particle is moved 
from point 1 to point 2, the work done on it will be 


£> to 

-^12 — ^ F c£s + ^ F*ds = A conS A* t 


where A* 2 is the work of the non-conservative force. The work of the 
conservative forces A C ons can be represented as E p% x — E Pt 2 . As a re- 
sult, we find that 

A l2 — E p% | — E Pt 2 -\-A*. 



12 

The total work of all the forces applied 
to the particle produces an increment 
of its kinetic energy [see Eq. (3.11)]. 
Consequently, 

E*. 2 ^k, i — i — ^P, 2 + ^i* 

whence, taking into consideration that 
E * + E p = E , we get 

E 2 -E ± = A* 2 (3.37) 

The result obtained signifies that 
the work of non-conservative forces 
is spent on an increment of the total mechanical energy of a 
particle. 

If the kinetic energy of a particle is the same in its final and initial 
positions (in particular, it equals zero), then the work of the non- 
conservative forces produces an increment of the potential energy 
of the particle: 


A* 2 = E p% 2 — -^p, i (3.38) 

(i? k 2 = i?fc tl ). This relation is useful when finding the difference 
between the values of the potential energy. 

Let us consider a system consisting of N particles in the field of 
conservative forces when the particles do not interact with one an- 
other. Each of the particles has the kinetic energy E ht t — m t Vi / 2 
(i is the number of the particle) and the potential energy E Pt t = 
= E Pt t (, x t , i/ij z t ). Considering the i-th particle independently of the 
other particles, we can find that 

Ei = 1 4* E Pt t “ constj 
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Summating these equations for all the particles, we arrive at the 
relation 

N N N 

E — 2 E t = 2 ^k, t S Ep i = const (3.39) 

i-l i=*l i-1 

This relation points to the additivity of the total mechanical energy 
for the system being considered. 

According to Eq. (3.39), the total mechanical energy of a system of 
non-interacting particles on which only conservative forces act remains 
constant. This statement expresses the law of energy conservation 
for the above mechanical system. 

If non-conservative forces F? act on particles in addition to con- 
servative forces, the total energy of the system does not remain con- 
stant, and 

£ 2 -5, = |(^), (3.40) 

where (Af 2 ) t is the work done by the non-conservative force applied 
to the t-th particle when it moves from its initial position to its 
final one. 

We established at the end of the preceding section that the work 
of friction forces is always negative. Therefore, when such forces 
are present in a system, the total mechanical energy of the system 
diminishes (dissipates), transforming into non-mechanical forms 
of energy (for example, into the internal energy of bodies, or, as is 
customarily said, into heat). This process is called the dissipation 
of energy. Forces leading to the dissipation of energy are called 
dissipative. Thus, friction forces are dissipative. In general, forces 
that always act oppositely to the velocities of particles and there- 
fore cause their retardation are called dissipative. 

We shall note that non-conservative forces are not necessarily 
dissipative ones. 


3.6. Potential Energy of Interaction 

Up to now, we treated systems of non-interacting particles. Now 
we shall pass over to the consideration of a system of two particles 
interacting with each other. Let Fi. be the force with which the sec- 
ond particle acts on the first one, and F 2 i be the force with which the 
first particle acts on the second one. In accordance with Newton’s 
third law, F la = — F 21 . 

Let us introduce the vector R ia = r 2 — r lf where r x and r a are 
the position vectors of the particles (Fig. 3.11). The distance between 
the particles equals the magnitude of this vector. Assume that the 
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magnitudes of the forces F X2 and F 21 depend only on the distance 
R 12 between the particles, and that the forces are directed along the 
straight line connecting the particles. We know that this holds for 
forces of gravitational and Coulomb interactions [see Eqs. (2.18) 
and (2.23)1. 

With these assumptions, the forces F a , and F 21 can be represented 
in the form 

F 12 = — F 2 i = / (# 12 ) e i2 (3-41) 

where e 12 is the unit vector of R 12 (Fig. 3.12), and / (/? 12 ) is a certain 
function R l2 that is positive when the particles attract each other and 
negative when they repel each other. 



d*tz 



Fig. 3.12 


Considering our system to be closed (there are no external forces), 
let us write the equations of motion for our two particles: 

miVi = F 12 , m 2 v 2 = F 21 

Let us multiply the first equation by dx x = v x dt , the second by 
dx 2 = \ 2 dt y and add the resulting equations *. We get 

dt + m 2 v 2 v 2 dt = F 12 dx i + F 21 dr 2 (3.42) 

The left-hand side of this equation is the increment of the kinetic 
energy of the system during the time dt [see Eq. (3.3)1, and the right- 
hand side is the work of the internal forces during the same time. 
Taking into account that F 21 = — F 12 , we can write the right- 
hand side as follows: 

d^int = F i2 dr A -f- F 21 dr 2 = F 12 c? (r 2 — r^) = — F 12 dR 12 (3.43) 

Introducing Eq. (3.41) for F 12 into the above equation, we get 

dA in x — — / (^12) e 12 ^Rl2 

Examination of Fig. 3.12 shows that the scalar product 
equals dR i2 — the increment of the distance between the particles. 

* Here it is expedient to use the symbol dr for the displacement instead of 
ds m 
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Thus, 

dA in x= f (R i 2 ) dlt 12 (3.44) 

The expression / (/?i 2 ) dR 12 can be considered as the increment of 
a certain function of i? 12 . Designating this function E p (i? 12 ) t we 
arrive at the equation 

/ (Ru) dR i2 = dE p ( R iz ) (3.45) 

Consequently, 

dAy nt = — dE p (3.46) 

With a view to everything said above, Eq. (3.42) can be written in 
the form dE± = — dE v , or 

dE = d ( E k + E p ) = 0 (3.47) 

whence it follows that the quantity E = E K + E p for the closed 
system being considered remains unchanged. The function E p (R 12 ) 
is the potential energy of interaction. It depends on the distance 
between the particles. 

Let the particles move from their positions spaced R$ apart to 
new positions spaced R$ apart. In accordance with Eq. (3.46), the 
internal forces do the following work on the particles: 

b 

A ab , 1Dt = - j dE p = E p [ *£>] - E p [£<»>] (3.48) 

a 

It follows from Eq. (3.48) that the work of the forces (3.41) does not 
depend on the paths of the particles and is determined only by the 
initial and final distances between them (the initial and final config- 
urations of the system). Forces of interaction of the form given by 
Eq. (3.41) are thus conservative. 

If both particles move, the total energy of the system is 

£ = ^1 + J2s !± + E p' U (R i2 ) (3.49) 

where E Pt la is the potential energy of interaction. 

Assume that particle 1 is fixed at a certain point which we shall 
take as the origin of coordinates (r x = 0). As a result, this particle 
will lose its ability to move, so that the kinetic energy will consist 
only of the single addend m 2 v\!2. The potential energy will be a func- 
tion only of r 2 . Therefore, Eq. (3.49) becomes 

£ = -^ + £ Ptla (r 2 ) (3-50) 

If we consider the system consisting of only the single particle 2 f 
then the function E Pt u will play the part of the potential energy of 
particle 2 in the field of the forces set up by particle 1 . In essence. 
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however, this function is the potential energy of interaction of parti- 
cles 1 and 2. In general, the potential energy in an external field of 
forces is essentially the energy of interaction between the bodies of 
the system and those producing a force field that is external relative 
to the system. 

Let us again turn to a system of two interacting free (“unfixed") 
particles. If the external force F* acts on the first particle in addition 
to the internal force, and the force F$ on the second particle, then 
the addends Fidr x and F£c?r 2 will appear in the right-hand side of 
Eq. (3.42), and their sum will give the work of the external forces 
d4 ext . Equation (3.47) will correspondingly become 

d C^k + ^P, la) = <£4ext (3.51) 

When the total kinetic energy of the particles remains constant 
(for example, equals zero), Eq. (3.51) becomes 

dJE/p t i a = d^ext (3.52) 

(here dE k = 0). Integration of this equation from configuration a to 
configuration b yields 

Ep. ia [MVl ~ ^P. la [* £’] = Ab. ext (3-53) 

(^k.b —Ex, a) [compare with Eq. (3.38)]. 

Let us extend the results obtained to a system of three interacting 
particles. In this case, the work of the internal forces is 

t^int = (®*i 2 + F 13 ) dr i + (E 21 + F 23 ) dr 2 -f- (F 31 -f- F 32 ) dr 3 (3.54) 

Taking into account that Fj* = — F fti , we ean write Eq. (3.54) 
in the form 

dA lnt = — F 12 d (r 2 — r,) — F i3 d (r 3 — rj — F 23 d (r 3 — r 2 ) = 

= — F 12 — F 13 dR 13 — F 23 dR 23 (3.55) 


where R|* = r h — r ; . 

Let us assume that the internal forces can be represented in the 
form Fin = fth (Rik) e tk [compare with Eq. (3.41)]. Hence, 

d-dint = fiJ[(Rlz) e 12 <®t2 /l3 (-^ 13 ) e 13 ^Rl3 fzz {E 23 ) ©23 

Each of the products e ik dR Jk equals the increment of the distance 
between the corresponding particles dR lk . Consequently, 

dA lDt = — fu (-ft 12) dR i2 — /13 (-^13) dR t s fz3 (E23) dR 2 3 — 

— — d [E Vt u (i? 12 ) + E p , 13 (f?i3) + Ep, ts (^?2s)J = dEp, u (3.56) 

Here 

Ep, i a = Ep, it (R12) 4 " E p , 13 (Rts) + Ep, 2 3 (R23) ( 3 . 57 ) 
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is the potential energy of interaction of the system. It consists of 
the energies of interaction of the particles taken in pairs. 

Equating dE k to the sum of the work dA ini — — dE Pt la and 
dAexu we arrive at Eq. (3.51) in which by E p la we must understand 
Eq. (3.57). 

The result obtained is easily generalized for a system with any 
number of particles. For a system of N interacting particles, the po- 
tential energy of interaction consists of the energies of interaction of 
the particles taken in pairs: 

ftp, ia — ftp. 12 (^ 12 ) + ftp, 13 (-# 13 ) + • • • + ftp 4 1 n (fti n) + 

+ ftp, 23 (^ 23 ) + • • • + ftp, 2 N (-ft 2 n) + • • • + ftp, JV-i, N (ftlV-1, n) (3*58) 
This sum can be written as follows: 

£p. la = 2 E p . ik (E ih ) (3.59) 

(i<fe) 

[note that in Eq. ( 3.58) the first subscript of each addend has a value 
smaller than the second onel. In connection with the fact that 
ftp.t k (ftifc) “ ftp t ki ( R hi ), the energy of interaction can also be 
represented in the form 

£p. la = y 2 E v 'tARih) (3.60) 

In the sums (3.59) and (3.60), the subscripts i and k take on values 
from 1 to N with observance of the condition that i < A: or i ^ k. 

Assume that a system consists of four particles, and that the 
first particle interacts only with the second one and the third particle 
only with the fourth one. The total energy of this system will be 

E = #k, 2 + -®k, 3 + ^k, 4 + ^p, 12 + ^p. 34 — 

= (E k% i + E kt 2 + E Pt 12 ) + ( E k. 3 4- E k. 4 + E p, 34 ) = E* + E" (3.61) 

Here E’ is the total energy of the subsystem formed by particles 
1 and 2, and E" is the total energy of the subsystem formed by par- 
ticles 3 and 4. In accordance with our assumption, there is no interac- 
tion between the subsystems. Equation (3.61) proves the additivity 
of energy (see the third paragraph of Sec. 3.1). 

In conclusion, let us find the form of the function E pAa for the 
case when the force of interaction is inversely proportional to the 
square of the distance between the particles: 

/(* 12) = ^ ( 3 ‘ 62 > 

(a is a constant). We remind our reader that for attraction between 
the particles a > 0, and for repulsion between them a < 0 [see the 
text following Eq. (3.41)]. 
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In accordance with Eq. (3.45) 

dE p< la ~ f (-^ 12 ) dR l2 = ^ dR i2 

Integration yields 

Ep, u = — jr- +const (3.63) 

^12 

Like the potential energy in an external field of forces, the poten- 
tial energy of interaction is determined with an accuracy up to an 
arbitrary additive constant. It is usually assumed that when R 12 = 
= oo t the potential energy becomes equal to zero [at such a distance 
the force (3.62) becomes equal to zero — the interaction be- 

tween the particles vanishes]. Hence, the additive constant in 
Eq, (3.63) vanishes, and the expression for the potential energy of 
interaction acquires the form 

£ P .ta=— ip- (3.64) 

rt \2 

In accordance with Eq. (3.53), the following work must be done 
to move the particles away from each other from the distance R 12 to 
infinity without changing their velocities: 

•4ext = ^P. la. °o — E p, la (R 12 ) 

Introduction of the corresponding values of the function (3.64) 
leads to the expression 

<365) 

When the particles are attracted to each other, we have a > 0; 
accordingly, positive work must be done to move the particles away 
from each other. 

Upon repulsion of the particles from each other, a < 0, and the 
work (3.65) is negative. This work has to be done to prevent the 
particles that are repelling each other from increasing their velocity. 


3.7. Law of Conservation of Energy 

Let us combine the results obtained in the preceding sections. 
We shall consider a system consisting of N particles of masses m 1? 

. . ., m N . Assume that the particles interact with one another 
with the forces F ih whose magnitudes depend only on the distance 
R ik between the particles. We established in the preceding section 
that such forces are conservative. This signifies that the work done by 
these forces on the particles is determined by the initial and final 
configurations of the system. Assume that the external conservative 
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force F| and the external non-conservative force F? act on the i-th 
particle in addition to the internal forces. The equation of motion of 
the i-th particle will therefore acquire the form 

N 

^iV|= 2 Fift + Fi+Ff (3.66) 

fe-i 

(*¥=i) 

where i = 1, 2, . . N. 

Multiplying the i-th equation by = <2r { = y, <2* and adding 
together all the N equations, we get 

N N f N \ N N 

2 m l v l dv i = 2 { 2 F, ft [*,+ 2 F,d Si + 2 Ffd Si (3.67) 

i-i i = l l fc-t J i-1 i-1 

(fc*i) 

The left-hand side is the increment of the kinetic energy of the 
system: 

N N , 

2 f»,Vi dVi = d 2 — 2 ” - = (3.68) 

i»l i-1 

[see Eq. (3.3)1. It follows from Eqs. (3.54)-(3.59) that the first 
term of the right-hand side equals the decrement of the potential 
energy of interaction of the particles: 

2 2 Fik\dr t = 

l ft-i J 

= - 2 F ift dR lft = - d 2 £p. ik (R,n) = - dE p , la (3.69) 

According to Eq. (3.26), the second term in Eq. (3.67) equals the 
decrement of the potential energy of the system in the external 
field of the conservative forces: 

N N 

2 Fi dSi = - d 2 E v , i (r,) = - dE p , e *t (3.70) 

i-1 i=l 

Finally, the last term in Eq. (3.67) is the work of the non-conserva- 
tive external forces: 

2 Ffds t = 2 dAl = dA * x t (3.71) 

i-1 i-1 

Taking into account equations (3.68)-(3.71), we can write Eq. (3.67) 
as follows: 

d (^k 4“ Ep t la + ^p, ext) = ^^ext (3.72) 


The quantity 


E = Ei-\- E Pf ia + E Pt ext 


(3.73) 
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is the total mechanical energy of the system. If external non-con- 
servative forces are absent, the right-hand side of Eq. (3.72) will 
vanish, and, consequently, the total energy of the system remains 
constant: 

E = E k + E Pt la + E p% ext = const (3.74) 

We have thus arrived at the conclusion that the total mechanical 
energy of a system of bodies on which only conservative forces act remains 
constant. This statement is the essence of one of the fundamental 
laws of mechanics — the law of conservation of mechanical energy* 
For a closed system, i.e. a system whose bodies experience no 
external forces, Eq. (3.74) has the form 

E — Z? k + 2? Pt la = const (3.75) 

In this case, the law of conservation of energy is formulated as follows: 
the total mechanical energy of a closed system of bodies between which 
only conservative forces act remains constant . 

If non-conservative forces, for example, forces of friction, act in 
a closed system in addition to conservative ones, then the total 
mechanical energy of the system is not conserved. Considering the 
non-conservative forces as external ones, we can write in accordance 
with Eq. (3.72) that 


dE = d ( E ^ -f- E Pf ia ) = rf*4non-cons (3.76) 

Integration of this equation yields 

= A\2, non-cons (3.77) 

The law of energy conservation for a system of non-interacting 
particles was formulated in Sec. 3.5 [see the text followingEq. (3.39)]. 


3.8. Energy of Elastic Deformation 

Not only a system of interacting bodies, but also a separately 
taken elastically deformed body (for example, a compressed spring, 
a stretched rod, etc.) can have potential energy. In this case, the 
latter depends on the mutual arrangement of separate parts of the 
body (for example, on the distance between adjacent coils of a 
spring). 

According to Eq. (3.13), the work A—kx 2 /2 must be done to stretch 
or compress a spring by the amount x . This work goes to increase 
the potential energy of the spring. Consequently, the potential ener- 
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gy of a spring depends on the elongation x as follows: 



(3.78) 


is 


where k is the spring constant (see Sec. 2.9). Equation (3.78) _ 
based on the assumpt ion that the potential energy of an undeformed 
spring equals zero. Figure 3.13 shows a plot of 2? p against x . 

The work determined by Eq. (3.14) is done in the elastic longitud- 
inal deformation of a bar or rod. Accordingly,’ the potential energy 
of an elastically deformed rod is 


£p = 4£-F 


(3.79) 


where E = Young’s modulus 
8 = relative elongation 
V = volume of the rod. 

Let us introduce the concept of the 
density of the energy of elastic deforma- 
tion u; e , which we shall define as the 
ratio of the energy dE v 


to the volume 
dV in which it is confined: 

dE p 
dV 



(x<0) 


^ = ( 3 - 80 ) 


Fig. 


(X>0) 
3.13 


Since the rod is assumed to be homogeneous and its deformation is 
uniform, i.e. identical at different points of the rod, the energy 
(3.79) is also distributed uniformly in the rod. We can therefore 
consider that 



(3.81) 


This expression also gives the density of the energy of elastic defor- 
mation in stretching (or compression) when the deformation is not 
uniform. In the latter case to find the energy density at a certain 
point of a rod, the value of 8 at this point must be introduced into 
Eq. (3.81). 

On the basis of Eqs. (2.32)-(2.34), it is not difficult to obtain the 
following equation for the density of the energy of elastic deforma- 
tion in shear: 



(3.82) 


where G is the shear modulus and y is the relative shear. 
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3.9. Equilibrium Conditions 
of a Mechanical System 

Let us consider a point particle whose motion is restricted so that 
it has only one degree of freedom*. This signifies that its position 
can be determined with the aid of a single quantity, for example the 
coordinate x . We can take as an example a ball sliding without fric- 
tion along a stationary wire bent in a vertical plane (Fig. 3.14a). 




Fig. 3.14 



Another example is a ball attached to the end of a spring and sliding 
without friction along a horizontal guide wire (Fig. 3.15a). The ball 
is acted upon in each case by a conservative force: the force of grav- 
ity and the elastic force of the deformed spring, respectively. Plots 
of the potential energy E p against x are shown in Figs. 3.146 
and 3.156. 

Since the balls move along the relevant wires without friction, 
the force with which the wire acts on the ball in each case is at right 
angles to the velocity of the ball and, consequently, does no work on 
the ball. Therefore, the energy is conserved 

E = Et + E p = const (3.83) 


• By the number of degrees of freedom of a mechanical system is meant 
the number of independent quantities with whose aid the position of the system 
can be set. This will be treated in greater detail in Sec. 11.5. 


Laws of Conservation 


101 


It follows from Eq. (3.83) that the kinetic energy can grow only at 
the expense of a reduction in the potential energy. Hence, if a ball 
is in a state such that its velocity equals zero and its potential 
energy is minimum, it will be unable to start moving without exter- 
nal action on it, i.e. it will be in equilibrium. 

Values of x equal to x 0 correspond to minima of E p in the graphs 
(in Fig. 3.15, x 0 is the length of the undeformed spring). The condi- 
tion of a minimum of the potential energy has the form 


dE p 

dx 


= 0 


(3.84) 


In accordance with Eq. (3.32), the condition (3.84) is equivalent to 
the fact that 


F x = 0 (3.85) 

(when E p is a function of only one variable, we have dE p ldx = 
= dE p ldx). Thus, the position corresponding to a minimum of the 
potential energy has the property that the force acting on the body 
equals zero. 

In the case shown in Fig. 3.14, the conditions (3.84) and (3.85) 
are also observed for x equal to x' 0 (i.e. for a maximum E p ). The posi- 
tion of the ball determined by this value of x will also be an equilib- 
rium one. This equilibrium, however, unlike that at x = x 0y will 
be unstable: it is sufficient to slightly move the ball out of this posi- 
tion, and a force will appear that will move it away from the posi- 
tion xo. The forces appearing when the ball is displaced from its posi- 
tion of stable equilibrium (for which x = x 3 ) are directed so that 
they tend to return the ball to its equilibrium position. 

Knowing the form of the function expressing the potential energy, 
we can arrive at a number of conclusions on the nature of motion of 
a particle. We shall explain this using the graph shown in Fig. 3.146 
to describe the motion of our particle. If the total energy has the 
value shown in the figure, then the particle can move either within 
the limits from x x to x 2 , or within the limits from x 3 to infinity. The 
particle cannot penetrate into the regions with x <C x x and x 2 < 
< x < x 3 because its potential energy cannot become greater than 
its total energy (if this occurred, then the kinetic energy would be 
negative). Thus, the region x 2 < x <C x 3 is a potential barrier 
through which the particle cannot penetrate having its given stock 
of total energy. The region x x <C x <C x 2 is called a potential well. 

If a particle in its motion cannot move away to infinity, the mo- 
tion is called finite. If the particle can travel any distance away, the 
motion is called infinite. A particle in a potential well performs 
finite motion. The motion of a particle with a negative total energy 
in the central field of forces of attraction will also be finite (it is 
assumed that the potential energy vanishes at infinity). 
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3.10. Law of Momentum Conservation 

In the preceding sections, we considered the additive integral of 
motion called energy. Let ns find another additive quantity that is 
conserved for a closed system. For this purpose, we shall consider 
a system of N interacting particles. Assume that external forces 
whose resultant is F* act on the i-th particle in addition to the inter- 
nal forces F ik . Let us write Eq. (2.10) for all the N particles: 

* 

Pi = F* 2 + F 13 4- . . . + Fift+ • • • + F lv -f- F 4 = 2 ^'ih + F 1 

h = 2 
N 

P 2 = F 21 + F 23 + . . . + F 2ft + • • . + F 2N + F 2 ~ S F 2h + F 2 

*=i 
(*=£ 2 ) 


N 

Pi = Ff t + Ff 2 -f- . . . + F ik + . . . + F iN 4 - F f = 2 F ** + F f 


N - 1 

P n =* Fjyj + F^ + . • . + F N k + • - • + F N 2v-i + F^* = 2 F Nk + F^ 

1 

Let us find the sum of these N equations. Since F 12 + F 21 = 0, etc., 
only the external forces will remain in the right-hand side. We thus 
arrive at the relation 

N 

-jf (Pi + P 2 + • • • + Pjv) = F t + F 2 4 - • • • + F^ = 2 F| (3.86) 

t=i 

The sum of the momenta of the particles forming a mechanical 
system is called the mome ntum of the system. Denoting this momen- 
tum by p, we find that 

N N 

P= S Pi= S m t \ t (3.87) 

i“l i -1 

It follows from Eq. (3.87) that the momentum is an additive quan- 
tity. 

Let us write Ep. (3.86) in the form 

■±p=2F, (3.88) 

4-1 

Hence it follows that in the absence of external forces, dp/dt = 0. 
Consequently, for a closed system, p is constant. This statement 
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forms the content of the law of momentum conservation, which is 
formulated as follows: the momentum of a closed system of point parti- 
cles remains constant . 

It should be noted that the momentum also remains constant for 
an unclosed system provided that the sum of the external forces is 
zero [see Eq. (3.88)]. When the sum of the external forces does not 
equal zero, but the projection of this sum on a certain direction 
does equal zero, the component of the momentum in this direction 
is conserved. Indeed, upon projecting all the quantities of Eq. (3.88) 
onto a certain direction x, we find that 

N 

£p*= 2^* (3-89) 

whence our statement follows. [We remind our reader that (dp/^) P r,*= 
= dpjdt , see Eqs. (1.40)1. 

The momentum of a system of particles can be represented as the 
product of the total mass of the particles and the velocity of the 
centre of mass of the system: 


p = mv G 


(3.90) 


The centre of mass (or the centre of inertia) of a system is defined 
as the point C whose position is set by the position vector r c deter- 
mined as follows: 


rc = 


m l + m 2 + 


. + 

. + m N 


2 = 2 w t r < 


(3.91) 


where m t — mass of the . i-th particle 

r f = position vector determining the position of this particle 
m = mass of the system. 

The Cartesian coordinates of the centre of mass equal the projec- 
tions of r c onto the coordinate axes: 


xc : 


2 m « x « 


yc- 


2 

m 


Z C = 


2 m t z t 


(3.92) 


It must be noted that in a homogeneous field of gravity forces the 
centre of mass coincides with the centre of gravity of a system. 

We find the velocity of the centre of mass by time differentiation 
of the position vector (3.91): 

_• _ 2 m « r < _ 2 m « Y < p 


(see Eq. (3.87)1. Hence follows Eq. (3.90). 
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For a closed system, p = mv c = const. Therefore, the centre of 
mass of a closed system either moves uniformly in a straight line, 
or remains stationary. 

A reference frame in which the centre of mass is at rest is called 
a centre-of-mass frame or a c.m. -frame. This frame is obviously 
an inertial one. 

A reference frame associated with measuring instruments is called 
a laboratory or an 1-frame* 


3.11. Collision of Two Bodies 

When bodies collide with one another, they become deformed. 
The kinetic energy which the bodies had before the collision par- 
tially or completely transforms into the potential energy of elastic 
deformation and into the so-called internal energy of the bodies. An 
increase in the internal energy of bodies is attended by elevation of 
their temperature. 

Two extreme kinds of collisions are distinguished: perfectly 

elastic and completely inelastic ones. A perfectly elastic collision 
is one in which the mechanical energy of the bodies does not trans- 
form into other non-mechanical kinds of energy. In such a collision, 
the kinetic energy transforms completely or partly into the potential 
energy of elastic deformation. Next the bodies return to their origi- 
nal shape, repelling each other. As a result, the potential energy of 
elastic deformation again transforms into kinetic energy, and the 
bodies fly apart with velocities whose magnitude . and direction 
are determined by two conditions — conservation of the total 
energy and conservation of the total momentum of the system of 
bodies. 

A completely inelastic collision is characterized by the fact that 
no potential energy of deformation is produced. The kinetic energy 
of the bodies completely or partly transforms into internal energy. 
After colliding, the bodies either move with the same velocity or 
are at rest. In a completely inelastic collision, only the law of con- 
servation of momentum is observed. The law of conservation of 
mechanical energy is not observed — instead of it the law of conser- 
vation of the total energy of different kinds — mechanical and inter- 
nal — is observed. 

Let us first consider a completely inelastic collision of two parti- 
cles (point particles) forming a closed system. Let the masses of the 
particles be m x and m 2 , and their velocities before colliding v 10 
and v 20 . In view of the law of momentum conservation, the total 
momentum of the particles after the collision must be the same as 
before it: 

m i y iO + tf* 2 y 20 =* m i y + m 2 y = ( m i + m 2 ) v (3,93) 
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(v is the identical velocity of both particles after colliding). It fol- 
lows from Eq. (3.93) that 


r m l V 10~H m 2 v »Q 
m l + w. 


(3.94) 


For practical calculations, Eq. (3.94) must be projected onto the 
correspondingly selected directions. 

Let us now consider a perfectly elastic collision. We shall limit 
ourselves to the case of a central collision of two homogeneous spheres. 
A collision is called central if 
the spheres before colliding trav- 
elled along the straight line passing 
through their centres. A central W — 
collision of two spheres can take 
place (1) if the spheres are moving 
toward each other (Fig. 3.16a), or (6) — 

(2) if one of the spheres is over- 
taking the other one (Fig. 3.166). 

We shall assume that the spheres 
form a closed system or that the 
external forces applied to them balance each other. We shall also 
assume that the spheres do not rotate. 

Let the masses of the spheres be m r and m 2 , the velocities of the 
spheres before the collision be v 10 and v 20 , and, finally, the veloci- 
ties after the collision be v 2 and v 2 . The equations of conservation 
of energy and momentum are: 


m, "z 


mivfo . m 2 v | 0 m x vf . m 3 v! 

2 2 2 2 

mjVjo-f m 2 v 20 = 

Taking into account that (a 2 — b 2 ) = (a — b) (a + b), 
write Eq. (3.95) in the form 

w*i (vio — Vt) (v, 0 + Vi) = m 2 (v 2 — v 20 ) (v 2 -f v 20 ) 

Relation (3.96) can be transformed as follows: 

(vio — Vi) = m2 (v 2 — v 20 ) 

We can state, from considerations of symmetry, that the veloci- 
ties of the spheres after the collision will be directed along the same 
straight line that was the path of the centres of the spheres before 
colliding. Consequently, all the vectors in Eqs. (3.97) and (3.98) are 
collinear. For the collinear vectors a, b, c, it follows from the equa- 
tion ab= ac that b =c. Therefore, comparing Eqs. (3.97) and (3.98), 
we can conclude that 


(3.95) 

(3.96) 
we can 

(3.97) 

(3.98) 


Vio+Vi = v 2 +v 20 


(3.99) 
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Multiplying Eq. (3.99) by m 2 and subtracting the result from 
Eq. (3.98), then multiplying Eq. (3.99) by m x and adding the result 
to Eq. (3.98), we get the velocities of the spheres after the collision: 

V ‘- ^ + m t * V 2 inj + m, (3.100) 

For numerical calculations, the relations (3.100) must be projected 
onto the x-axis along which the spheres are moving (see Fig. 3.16). 

We must note that the velocities of the spheres after a perfectly 
elastic collision cannot be the same. Indeed, equating expressions 
(3.100) for v x and v 2 and performing the relevant transformations, 
we get 


Vio = v 20 

Consequently, for the velocities of the spheres to be the same after 
the collision, they must also be the same before it, but in this case 
no collision can take place. Hence, it follows that the condition of 
equality of the velocities of the spheres after the collision is incom- 
patible with the law of conservation of energy. 

Let us consider the case when the masses of the colliding spheres 
are equal: m x = m 2 . It follows from Eqs. (3.100) that in this con- 
dition 

Vi = V 20 , v 2 = v 10 

i.e. the spheres exchange velocities when they collide. Particularly, 
if one of the spheres of the same mass, for instance, the second one, 
is stationary before the collision, then after it it will travel with the 
velocity which the first sphere originally had, while the first sphere 
after the collision will be stationary. 

We can use Eqs. (3.100) to find the velocity of a sphere after an 
elastic collision with a stationary or a moving wall (which we can 
consider as a sphere of infinitely great mass m 2 and infinitely great 
radius). Dividing the numerator and denominator of Eqs. (3.100) 
by m 2 and disregarding the terms containing the factor mjm 2 , 
we get 

Vi=2v 2 o — v 10 , v 2 =:v 2 o 

The result obtained shows that the velocity of the wall remains 
unchanged. The velocity of the sphere, however, if the wall is station- 
ary (v 20 = 0), reverses. If the wall is moving, the magnitude of the 
velocity of the sphere also changes (it grows by 2i; 20 if the wall moves 
toward the sphere and diminishes by 2 i; 20 if the wall moves away 
from the sphere catching up with it). 
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3.12. Law of Angular Momentum 
Conservation 


We already know two additive quantities obeying laws of conser- 
vation: energy and momentum. Now we shall find a third quantity 
of this kind. For this purpose, we shall consider a system consisting 


of two interacting particles on which 
(Fig. 3.17). The equations of motion of 
the particles have the form 

77l|V| = Fi2 -(- Fi, 771 2V2= F 21 ^2 

Let us find the vector product of the first 
equation and the position vector of the 
first particle r t and of the second equa- 
tion and the position vector of the second 
particle r 2 , placing the position vectors 
at the left: 


external forces also act 




(3.101) 



frajrj, F 12 ] + [r,, Fj 

t*2» ^2] = [ r 2i F 21 )+ [r 2 , F 2 ] 

A vector product of the kind [rv] is equivalent to the expression 
d[rv]ldt. Indeed, according to Eq. (1.55) 

d 


dt 


[rv) = [rv] + [rv] = [rv]* 


(3.102) 


because [rv] = [vv] = 0. Making such a substitution in Eqs. (3.101) 
and taking into account that F 21 = — F 12 , we get the equations 


m i ~dt I* 1 ’ v i] = [ r i’ r i2l+[ri, F,] 


m 2 -£■ [r 2 , v 2 ] = — [r 2 , F 12 ] + [r 2 , F 2 ] j 


(3.103) 


Mass is a constant scalar quantity. It can therefore be put inside 
the time derivative and into the vector product: 

m 4 [rv 1=W [T ' my] = W [r ’ Pi 

With this in view, we shall find the sum of Eqs. {3.103). We get 

4- {t r i- Pi) + t r 2> P2O = f( r i - r 2 ), F 12 ] + [r„ F,] + [r 2 , F 2 ] 

The vectors r 2 — r 2 and F 12 are collinear. Consequently, their vector 
product equals zero. We thus obtain the relation 

4{l r *' Pi 1 + f r 2 * P 2 O = [ri» F iJ + [r 2 , F 2 ] (3.104) 
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If the system is closed, the right-hand side of this relation vanishes, 
and, therefore, 

hi, p a l + [r 2 , p 2 ] = const 

We have arrived at an additive quantity obeying a law of conser- 
vation that is called the angular momentum (or the moment of 
momentum) relative to point O (see Fig. 3.17). 

1 

/=rsin« I 




For a separate particle, the angular momentum relative to point O 
is defined as the pseudovector 

L = [r, p] = (r, mv] (3.105) 

The angular momentum of a system relative to point O is defined as 
the vector sum of the angular momenta of the particles in the system: 

L=SL i =Str i , Pi] (3.106) 

i i 

The projection of the vector (3.105) onto the z-axis is called the 
angular momentum of a particle relative to this axis: 

•f-'z “ [*p] P r, * (3.107) 

Similarly, the angular momentum of a system relative to the £-axis 
is defined as the scalar quantity 

£z = S[r„ Pilpr,, (3.108) 

A glance at Fig. 3.18 shows that the magnitude of the angular mo- 
mentum vector of a particle is 

L = rp sin a = Ip (3.109) 

where l — r sin a is the length of a perpendicular dropped from 
point O onto the straight line along which the momentum of the 
particle is directed. This length is called the arm of the momentum 
relative to point O. It is assumed in Fig. 3.18 that point O relative 
to which the angular momentum is taken and the vector p are in 
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the plane of the drawing. The vector L is at right angles to the plane 
of the drawing and is directed away from us. 

Let us consider two typical examples. 

1. Assume that a particle is moving along the straight line depict- 
ed in Fig. 3.18 by the dash line. In this case, the angular momentum 
of the particle can change only in magnitude. The magnitude of the 
angular momentum is 

L = mvl (3.110) 

the arm l remaining constant here. 

2. A particle of mass m moves along a circle of radius R (Fig. 3.19). 
The magnitude of the angular momentum of the particle relative 
to the centre of the circle O is 

L = mvR (3.111) 

The vector L is perpendicular to the plane of the circle. The direction 
of motion of the particle and the vector L form a right-handed sys- 




tem. Since the arm, which equals R, remains constant, the angular 
momentum can change only as a result of a change in the magnitude 
of the velocity. Upon uniform motion of the particle along the cir- 
cle, the angular mom entum remains constant both in magnitude and 
in direction. ~ 

The pseudovector 

M = [rF] (3.112) 

is called the moment of the force F relative to point O (or the torque 
relative to this point) from which the position vector r is drawn to 
the point of application of the force (Fig. 3.20). Inspection of the 
figure shows that the magnitude of the moment of the force can be 
written in the form 


M = rF sin a =IF 


( 3 . 113 ) 
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where l = r sin a is the arm of the force (the moment or lever arm) 
relative to point 0 (i.e. the length of a perpendicular dropped from 
point O onto the straight line along which the force acts). 

The projection of the vector M onto an axis z passing through 
point O relative to which M has been determined is called the mo* 
ment of the force (the torque) relative to this axis: 

M z = [rF] pr z (3.114) 

Let us resolve the force vector F (Fig. 3.21) into three mutually 
perpendicular components: F H parallel to the z-axis, F R perpendic- 
ular to the z-axis and acting along a line passing through the axis, 
and, finally, F x perpendicular to the plane passing through the axis 
and the point of application of the force (this component is designat- 
ed in the figure by a circle with a cross in it). If we imagine a circle 
of radius R with its centre on the z-axis, then the component F x will 
be directed along a tangent to this circle. The moment of the force F 
relative to point O equals the sum of the moments of the components: 
M = M n -f M r + M t . The vectors M„ and M R are perpendicular 
to the z-axis, therefore their projections onto this axis equal zero. 
The moment M x has a magnitude equal to rF x and makes the angle a 
with the z-axis. The cosine of a is R/r . Hence, the moment of the com 
ponent F x relative to the z-axis has the magnitude M x cos a = RF X . 
The moment of the force F relative to the z-axis thus equals 

M z = RF X (3.115) 

Up to now, we understood F x to stand for the magnitude of the com- 
ponent F x . But F x can also be considered as the projection of the 
vector F onto the unit vector t that is tangent to a circle of radius R 
and is directed so that motion along the circle in the direction of x 
forms a right-handed system with the direction of the z-axis. With 
such an interpretation of F x , Eq. (3.115) will also determine the 
sign of M z . 

The moment M of a force characterizes the ability of the force to 
rotate a body about the point relative to which it is taken. We must 
note that when a body can rotate arbitrarily relative to point <9, 
the force will cause it to rotate about an axis that is perpendicular 
to the plane containing the force and point O, i.e. about an axis 
coinciding with the direction of the moment of the force relative to 
the given point. 

The moment of a force relative to the z-axis characterizes the abil- 
ity of the force to rotate a body about this axis. The components F|j 
and F r cannot cause rotation about the z-axis. Such rotation can be 
produced only by the component F x , and the success of the rotation 
will grow with an increasing moment arm i?. 

Two equal, parallel and oppositely directed forces are called a 
force couple (Fig. 3.22). The distance l between the lines along 
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which the forces act is called the arm of the couple. The total moment 
of the forces F x and F 2 forming the couple is 

M = [r lt FJ + Irj, F a ] 

Since F x = — F 2 , we can write 

M= [r^, -f” [^ 2 ? I** 2 J = [(r 2 r*), ^ 2 ] (3.116) 

where r 12 = r 2 — r x is the vector drawn from the point of application 
of the force F x to the point of application of F 2 . Equation (3.116) 




V>m 2 

Fig. 3.23. 


does not depend on the choice of point O. Consequently, the moment 
of a force couple relative to any point will be the same. The vector 
of the moment of a force couple is perpendicular to the plane contain- 
ing the forces (see Fig. 3.22) and numerically equals the product 
of the magnitude of any of the forces and the arm. 

Forces of interaction between particles act in opposite directions 
along the same straight line (Fig. 3.23). Their moments relative to 
an arbitrary point O are equal in magnitude and opposite in direction. 
Therefore, the moments of the internal forces balance one another 
in pairs, and the sum of the moments of all the internal forces for 
any system of particles, particularly for a solid body, always equals 
zero: 

EM„, = 0 (3.117) 

In accordance with definitions (3.106) and (3.112), we can write 
Eq. (3.104) as follows: 

4~L= 2 Mext (3.118) 

This equation is similar to Eq. (3.88). A comparison of these equa- 
tions shows that just like the time derivative of the momentum of 
a system equals the sum of the external forces, so does the time 
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derivative of the angular momentum equal the sum of the moments 
of the external forces. 

It follows from Eq. (3.118) that in the absence of external forces 
dh/dt = 0. Hence, L is constant for a closed system. This statement 
is the content of the law of angular momentum conservation, which 
is formulated as follows: the angular momentum of a closed system of 
point particles remains constant. 

We have proved Eq. (3.118) for a system of two particles. It can 
be generalized quite simply, however, for any number of particles. 
Let us write the equations of motion of the particles: 


SF lh + F t , 

A 


m C f i — 2 F|» 

h 


m N \ n — + 

k 

Multiplying each of the equations by the corresponding position 
vector, we get [see Eq. (3.102)]: 

4-tri. Pil =2 I'*. F i*] + [«■!. F il> 

h 


[*■(, Pi] = 2 i r *’ +[ r i» F iit 

h 


4 Piv] — 2 FjVft] + » F ivl 

* 

Let us add up all the N equations: 

4*2 L ‘= 2 F J ft] + 2[r i , F,1 

i i, ft i 

(U=k) 

The first sum in the right-hand side is the sum of the moments of all 
the internal forces, which, as we have shown, equals zero [see 
Eq. (3.117)1. The second sum in the right-hand side is the sum of the 
moments of the external forces. Consequently, we have arrived at 
Eq. (3.118). 

We must note that the angular momentum also remains constant 
for an unclosed system provided that the total moment of the exter- 
nal forces equals zero [see Eq. (3.118)1. 
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Projection of all the quantities in Eq. (3.118) onto a certain direc- 
tion z yields 

Lz = 2 e xt (3.119) 

according to which the time derivative of the angular momentum 
of the system relative to the z-axis equals the sum of the moments of 
the external forces relative to this axis. 

It follows from Eq. (3.119) that when the sum of the moments of 
the external forces relative to an axis equals zero, the angular mo- 
mentum of the system relative to this axis remains constant. 


3.13. Motion in a 

Central Force Field 

Let us consider a particle in a central force field. We remind our 
reader that the direction of the force acting on a particle at any point 
of such a field passes through point O — the centre of the field — while 



Fig. 3.24 



the magnitude of the force depends only on the distance from this 
centre. It is easy to see that the dependence of the force F on r has 
the form 

F = / (r) e r (3.120) 

where e r is the unit vector of the position vector (Fig. 3.24), and/(r) 
is the projection of the force vector onto the direction of the posi- 
tion vector, i.e. F r .The function / (r) is positive for a repulsive force, 
and negative for an attractive one. Figure 3.24 has been drawn for 
the case of repulsion of a particle from the force centre. Equation 
(3.120) naturally holds only if the origin of coordinates (i.e. the 
point from which the position vectors are drawn) is at the centre of 
the field. 

The moment of the force (3.120) relative to point O obviously equals 
zero. This follows from the fact that the moment arm equals zero. 
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Hence, in accordance with Eq. (3.118), we see that the angular mo- 
mentum of a particle moving in a central force field remains constant. 
The vector L = [rp] at each moment of time is perpendicular to the 
plane formed by the vectors r and p (Fig. 3.25). If L = const, this 
plane will be fixed. Thus, when a particle moves in a central force 
field, its position vector always remains in one plane. The vector p 
is also permanently in the same plane. Consequently, the trajectory 
of the particle is a plane curve. The plane containing the trajectory 
passes through the centre of the field (see Fig. 3.25). 

Figure 3.26 shows a portion of the trajectory of the particle (the 
vector L is directed beyond the drawing). During the time dt y the 
position vector of the particle describes the hatched area dS . This 
area equals half the area of the parallelogram constructed on the 
vectors r and v dt. The area of the parallelogram, in turn, equals the 
magnitude of the vector product [r, v dt] [see the text following 
Eq. (1.28)1. Thus, the area of the hatched triangle is 

dS = y I [rv] l dt = I ftPj I dt = L d * 


(we have put the scalar multiplier dt outside the symbol of the vector 
product). Dividing both sides of the equation obtained by dt y we 
find that 


dS L 

dt 2m 


(3.121) 


The quantity dS/dt y i.e. the area described by the position vector 
of a particle in unit time, is called the sector velocity. In a central 
force field, L — const, hence the sector velocity of a particle also 
remains constant. 

Let us find an expression for the angular momentum of a particle 
in the polar coordinates r and cp (Fig. 3.27). According to Eqs. (1.68)- 
(1.71), the vector velocity of the particle can be represented in the 
form 

• • 

v = v r + V* = re T + rqje, (3.122) 

Using this expression in the equation for L, we get 
L — m [rv] = m [rvy] + m [rv,] 

The vectors r and v r are collinear, therefore the Srst addend equals 
zero. Consequently 

L = to [rv,] =m[r, rcpe,] = mr<p [r, e,] 

The vector product [r, e,] equals re z , where e z is the unit vector of the 
z-axis (in Fig. 3.27 this unit vector is directed toward us). Thus, 

(3.123) 


L = mr 2 qpe x 
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Hence we conclude that 

L z = mr 2 <p (3.124) 

where L z is the projection of the angular momentum onto the z-axis. 
The magnitude of the angular momentum equals the magnitude of 
Eq. (3.124). 

Let us now turn to the energy of a particle. Central forces are con- 
servative (see Sec. 3.4). According to Eq. (3.30), the work of a con- 



Fig. 3.26 Fig. 3.27 


servative force equals the decrement of the potential energy of the 
particle E p . Hence, for the force (3.120), the relation dA = — dE p 
holds, i.e. 

dE p = — dA — — f (r) e r dr = — / (r) dr 

Integration of this expression yields 

E p j / (r) dr (3.125) 

It follows from Eq. (3.125) that the potential energy of a particle 
in a field of central forces depends only on the distance r from the 
centre: E p = E p ( r ). 

Of special interest are forces inversely proportional to the square 
of the distance from the force centre. The function / (r) in Eq. (3.120) 
has the following form for them: 

/ ( r ) = ~t (3-126) 

where a is a constant quantity (a >• 0 corresponds to repulsion from 
the centre, and a <0 to attraction to the centre). Among such forces 
are gravitational and Coulomb ones. 

Introduction of the function (3.126) into Eq. (3.125) yields 
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where C is an integration constant. The potential energy is conven- 
tionally considered to vanish at infinity (i.e. at r — oo). In this 
condition, C — 0, and 

= ~ (3.127) 

Thus, the total mechanical energy of a particle moving in a central 
field of forces that are inversely proportional to the square of the 
distance from the centre is determined by the expression 

S—TT L +-^ (3-128) 

Substituting the sum of the squares of the velocities v r and v v for 
the square of the velocity v in accordance with Eq. (3.122), i.e. sub- 
stituting the expression r 2 + r 2 ^ 2 for i? 2 , we obtain 

(3.129) 

The energy and the angular momentum of a particle are conserved 
in a central field. Consequently, the left-hand sides of Eqs. (3.124) 
and (3.129) are constants. We have thus arrived at a system of two 
differential equations: 

mr 2 (p == L z = const 
mr 2 -f- mr 2 i p 2 -| — — = 2E = const 

After integrating these equations, we can find r and <p as functions 
of t , i.e. the trajectory and the nature of motion of the particle. It 
must be noted that Eqs. (3.130) contain the first time derivatives of 
r and <p. They are, therefore, much easier to solve than equations 
following from Newton’s laws, which contain the second derivatives 
of the coordinates. 

Solution of the system of equations (3.130) is beyond the scope of 
this book. We shall limit ourselves to giving the final result. The 
trajectory of the particle is a conical section, i.e. an ellipse, or a 
parabola, or a hyperbola. Which of these curves is observed in a given 
concrete case depends on the sign of the constant a and the magnitude 
of the total energy of the particle. 

For repulsion (i.e. when a > 0), the trajectory of the particle can 
only be a hyperbola (Fig. 3.28). If L z = 0, the hyperbola degenerates 
into a straight line whose continuation passes through the force centre. 
We must note that when a > 0, the total energy (3.128) cannot be 
negative. 
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For attraction (i.e. when a < 0), the total energy may be either 
positive or negative; in particular, it may equal zero. When E > 0 f 
the trajectory is a hyperbola (Fig. 3.29). When E = 0, the trajectory 
will be a parabola. This case takes place if a particle begins its mo- 
tion from a state of rest at infinity [see Eq. (3.128)]. Finally, when 




E < 0, the trajectory will be an ellipse. At values of the energy and 
the angular momentum complying with the condition that E = 
= — ma 2 /2L 2 , the ellipse degenerates into a circle. 

Motion along an ellipse is finite, and that along a parabola or hy- 
perbola — infinite (see Sec. 3.9). 


3.14. Two-Body Problem 

A two-body problem is the name given to a problem on the motion 
of two interacting particles. The system foimed by the particles is 
assumed to be closed. We learned in Sec. 3.10 that the centre of mass 
of a closed system is either at rest or moves uniformly in a straight 
line. We shall solve the problem in a centre-of-mass frame (a c.m. 
frame), placing the origin of coordinates at point C. In this case, 
r c = (m x T x + m 2 r 2 )/(m x + m 2 ) = 0, i.e. 

m x r x = — m 2 r t (3.131) 

(Fig. 3.30a). Let us introduce the vector 

r = r 2 — r x (3.132) 

determining the whereabouts of the second particle relative to the 
first one (Fig. 3.306). By simultaneously solving Eqs. (3.131) and 
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(3.132), it is easy to find that 


r i 


m l + m 2 


r, 


mi 


m 1 -|- m 2 


(3.133) 


Similarly to Eq. (3.59), we can write that Fi 2 F 2 i — / (r) e r , 

where / (r) is a function of the distance between the particles. It 


(a) 


C 


m, 


0 

m2 

(b) 

6r 

C 

T 

m, 



(c) 

f/2~ f( r )*r 

C 


m, 


C 4 


Fig. 3.30 


is positive for forces of attraction (Fig. 3,30c) and negative for forces 
of repulsion. Let us write the equations of motion of our particles: 

• • • • 

= / ( r ) e r , m 2 x 2 =* — / (r) e r 

Division of the first equation by of the second one by m 2 , and 
subtraction of the first equation from the second yield 


r 2 






• • 

According to Eq. (3.132), the left-hand side is r. Hence, 

'“-(-sr+^r) (mm) 

Equation (3.134) can formally be considered as the equation of 
motion of an imaginary particle in a central force field. The position 
of the particle relative to the force centre is determined by the po- 
sition vector r. According to Eq. (3.134), the mass \i determined by 
the condition that 







(3.135) 


must be ascribed to our imaginary particle. Hence, 




m l m 2 
m x + m t 


(3.136) 


The quantity (3.136) is called the reduced mass of the particles. 
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A two-body problem thus consists in a problem on the motion of 
a single particle in a central force field. Finding r as a function of t 
from Eq. (3.134), we can use Eqs. (3.133) to determine r x (£) and 
r 2 (t). The vectors r x and r 2 are laid off from the centre of mass C 
of the system. Therefore, to be able to use Eqs. (3.133), we must also 
lay off the position vector r of the imaginary particle from point C 
[for real particles the vector (3.132) is drawn from the first particle 
to the second one]. 

It can be seen from Eqs. (3.133) and Fig. 3.30 that both particles 
move relative to the centre of mass along geometrically similar tra- 
jectories*. The straight line joining the particles constantly passes 
through the centre of mass. 


* When the force of interaction is inversely proportional to the square of 
the distance between the particles, these trajectories are ellipses, or parabolas, 
or hyperbolas (see Sec. 3.13). 



CHAPTER 4 NON-INERTIAL 

REFERENCE FRAMES 


4.1. Forces of Inertia 

Newton’s laws are obeyed only in inertial reference frames. A given 
body travels with the same acceleration a relative to all inertial 
frames. Any non-inertial reference frame travels with a certain ac- 
celeration relative to inertial frames, therefore the acceleration of a 
body in a non-inertial reference frame a' will differ from a. Let us use 
the symbol a 0 to denote the difference between the accelerations of a 
body in an inertial and a non-inertial reference frame: 

a — a' = a 0 (4.1) 

For a non-inertial frame in translational motion, a 0 is the same for 
all points of space (a 0 = const) and is the acceleration of the non- 
inertial reference frame. For a rotating non-inertial frame, a 0 will be 
different at different points of space [a 0 = a 0 (r'), where r' is the 
position vector determining the position of a point relative to the 
non-inertial reference frame]. 

Let the resultant of all the forces produced by the action of other 
bodies on the given body be F. Hence, according to Newton’s second 
law, the acceleration of the body relative to any inertial frame is 



m 


The acceleration of the body relative to a non-inertial frame, in 
accordance with Eq. (4.1), can be represented in the form 



Hence, it follows that even when F = 0, the body will travel rela- 
tive to the non-inertial reference frame with the acceleration — a 0 , 
i.e. as if a force equal to — ma 0 acted on it. 

What has been said above signifies that we can use Newton’s equa- 
tions in describing motion in non-inertial reference frames, if in addi- 
tion to the forces due to the action of bodies on one another, we take 
into account the so-called forces of inertia F in . The latter should be 
assumed equal to the product of the mass of a body and the difference 
between its accelerations relative to the inertial and non-inertial 
reference frames taken with the opposite sign: 


F ln = — m(a — a')= — ma 0 


(4.2) 
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The equation of Newton’s second law for a non-inert ial reference 
frame will accordingly be 

ma' = F + F m (4.3) 

We shall explain our statement by the following example. Let 
us consider a cart with a bracket secured on it from which a ball is 
suspended on a string (Fig. 4.1). As long as the cart is at rest or is 
moving without acceleration, the string is vertical, and the force of 
gravity P is balanced by the reac- 
tion of the string F r . Now let us 
bring the cart into translational 
motion with the acceleration a 0 . 

The string will deviate from a 
vertical line through an angle 
such that the resultant of the 
forces P and F r imparts an accel- 
eration of a 0 to the ball. The 
ball will be at rest relative to 
a reference frame associated with 
the cart, although the resultant 
of the forces P and F r differs 
from zero. The absence of accel- 
eration of the ball relative to this 
reference frame can be explained formally by the fact that in addi- 
tion to the forces P and F r whose sum equals ma 0 , the force of inertia 
F ln = — ma 0 also acts on the ball. 

The introduction of inertial forces permits us to describe the mo- 
tion of bodies in any (both inertial and non-inertial) reference frames 
using the same equations of motion. 

One must understand distinctly that the forces of inertia may never 
be treated on a par with such forces as elastic, gravitational, and 
friction ones, i.e. with forces produced by the action on a body of 
other bodies. Forces of inertia are due to the properties of the refer- 
ence frame in which mechanical phenomena are being considered. 
In this sense, they can be called fictitious forces. 

The consideration of forces of inertia is not a necessity. Any mo- 
tion, in principle, can always be considered relative to an inertial 
reference frame. In practice, however, it is exactly the motion of 
bodies relative to non-inertial reference frames, for instance, rela- 
tive to the Earth’s surface, that is often of interest to us. The use of 
inertial forces makes it possible to solve the relevant problem directly 
relative to such a reference frame, and this is frequently much simpler 
than consideration of the motion in an inertial frame. 

A feature of inertial forces is that they are proportional to the mass 
of a body. Owing to this property, inertial forces are similar to gravi- 
tational ones. Imagine that we are in a closed cab removed from all 
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external bodies and moving with the acceleration g in the direction 
which we shall call the “top” (Fig. 4.2). All the bodies in the cab 
will behave as if they experienced the force of inertia —mg. In partic- 
ular, a spring to whose end a body of mass m is fastened will stretch 
so that the elastic force balances the force of iner- 
tia — mg. The same phenomena will be observed, 
however, when the cab is stationary and is near 
the Earth’s surface. Having no possibility of 
“looking out” of the cab, we would not be able to 
establish by any experiments conducted in the 
cab whether the force— mg is due to its acceler- 
ated motion or to the action of the Earth’s 
gravitational field. On these grounds, we speak 
of the equivalence of forces of inertia and gravi- 
tation. This equivalence underlies Albert Ein- 
stein’s general theory of relativity. 



r -mg 
Bottom 
Earth 


4.2. Centrifugal Force of Inertia 


Let us consider a disk rotating about a vertical 
axis z' perpendicular to it with the angular velo- 
Fig. 4.2 city 0) (Fig. 4.3). A ball fitted onto a spoke and 

connected to the centre of the disk by a spring 
rotates together with the disk. The ball occupies a position on the 
spoke such that the force F Spr stretching the spring is equal to the 



Fig. 4.3 

product of the mass of the ball m and its acceleration a n = — to 2 R 
[see Eq. (1.102); R is a position vector drawn to the ball from the 
centre of the disk. Its magnitude R gives the distance from the 
centre of the disk to the ball]: 

Fspr = — wud 2 R (4.4) 
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The ball is at rest relative to the reference frame associated with 
the disk. This can be formally explained by the circumstance that 
apart from the force (4.4), the ball experiences the force of inertia 

F cf = mco 2 R (4.5) 

directed along a radius from the centre of the disk. 

The force of inertia (4.5) set up in a rotating (relative to inertial 



Fig. 4.4 Fig. 4.5 

frames) reference frame is called the centrifugal force of inertia. 
This force acts on a body in a rotating reference frame regardless of 
whether the body is at rest in this frame (as we have assumed up to 
now) or is moving relative to it with the velocity v'. 

If the position of a body in a rotating reference frame is character- 
ized by the position vector r', then the centrifugal force of inertia 
can be represented in the form of a vector triple product 

F cf = m [ 03 , [r\ co]] (4.6) 

Indeed, the vector b = [r', co] is directed at right angles to the vectors 
co and F cr “toward us” (Fig. 4.4), and its magnitude is cor' sin a = c oR. 
The vector product of the mutually perpendicular vectors m<o and b 
coincides in direction with F C f, and its magnitude is nuob = m(x> 2 R == 
= Fot. 

In the accurate solution of problems on the motion of bodies rela- 
tive to the Earth’s surface, account must be taken of the centrifugal 
force of inertia equal to moo 2 /?, where m is the mass of a body, co 
is the angular velocity of the Earth in its rotation about its axis, and 
R is the distance to the body from the Earth’s axis (Fig. 4.5). When 
the height of bodies above the Earth’s surface (their altitude) is not 
great, we may assume that R = R cos cp (R is the Earth’s radius, 

and cp is the latitude of the locality). The expression for the centrif- 
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ugal force of inertia thus becomes 

Fot — mu> z Rv cos q> (4.7) 

The acceleration of free fall of bodies g observed relative to the 
Earth is due to the action of the force F 6 with which a body is attract- 
ed by the Earth, and of the force F C f. The resultant of these forces 

P = F e + F cf (4.8) 

is the force of gravity equal to mg [see Eq. (2.38)1. 

The difference between the force of gravity P and the force of attrac- 
tion to the Earth F g is not great because the centrifugal force of 
inertia is much smaller than F g . Thus, for a mass of 1 kg, the maxi- 
mum value of F ct observed at the equator is 

mo)2 J R E =l (-g^o") * X 6.4 x 10« = 0.035 N 

whereas F g approximately equals 9.8 N, i.e. is almost 300 times 
greater. 

The angle a between the directions of F g and P (see Fig. 4.5) can 
be found by using the theorem of sines: 

sin a F c r mo 2 i?E c °s <P _ 0.035 a aaoc 

sin <p P mg 9.8 ^ Y 

whence 

sin a ^ 0.0035 sin <p cos <p « 0.0018 sin 2q> 

The sine of a small angle may be approximately replaced by the value 
of the angle itself. Such approximation yields 

a & 0.0018 sin 2<p (4.9) 

Thus, the angle a varies within the limits from zero (at the equator, 
where <p = 0, and at the poles, where <p = 90°) to 0.0018 rad or 6' 
(at a latitude of 45°). 

The direction of the force P coincides with that of a string tensioned 
by a weight, which is called the direction of a plumb or the vertical 
direction. The force F g is directed toward the centre of the Earth. 
Therefore, a vertical line is directed toward the centre of the Earth 
only at the poles and the equator, and deviates at intermediate lati- 
tudes by the angle a determined by expression (4.9). 

The difference Fa — P vanishes at the poles and reaches a maxi- 
mum equallingO. 3% of the force F g at the equator. Owing to the ob- 
lateness of the Earth, the force F g varies somewhat with the latitude, 
being about 0.2% less at the equator than at the poles. As a result, 
the acceleration of free fall varies with the latitude within the limits 
from 9.780 m/s 2 at the equator to 9.832 m/s 2 at the poles. The value 
of g = 9.806 65 m/s 2 is taken as the standard one. 
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We must note that a freely falling body moves relative to an iner- 
tial, for example, a heliocentric, reference frame with the acceleration 
a = F g /m (and not g). A glance at Fig. 4.5 shows that from the equal- 
ity of the acceleration g for different bodies we get the equality of the 
accelerations a. Indeed, the triangles constructed on the vectors 
F* and P for different bodies are similar (the angles a and (p for all 
bodies at the given point on the Earth’s surface are identical). Conse- 
quently, the ratio F g /P, which coincides with the ratio a/g, is the 
same for all the bodies. Hence, it follows that we get identical values 
of a for the same g’s. 


4.3. Coriolis Force 

When a body moves relative to a rotating reference frame, another 
force called the Coriolis force appears in addition to the centrifugal 
force of inertia. 

The appearance of a Coriolis force can be detected in the follow- 
ing experiment. Let us take a horizontally arranged disk that can 
rotate about a vertical axis. We draw radial line OA on the disk 
(Fig. 4.6a). Let us launch a ball with 
the velocity v' in the direction from O 
to A. If the disk does not rotate, the 
ball will roll along the radius we have 
drawn. If the disk is rotated in the 
direction shown by the arrow, how- 
ever, then the ball will roll along dash 
curve OB , and its velocity relative to 
the disk v' will change its direction. 

Consequently, the ball behaves rela- 
tive to the rotating reference frame as 
if it experiences the force F c perpen- 
dicular to the velocity v'. 

To make the ball roll on the rotat- 
ing disk along the radius, we must 
install a guide, for instance, in the form of rib OA (Fig. 4.66). 
When the ball is rolling, the guide rib exerts the force F r on it. The 
ball travels with a velocity constant in direction relative to the rotat- 
ing frame (disk). This can formally be explained by the fact that the 
force F r is balanced by the force of inertia F c applied to the ball 
at right angles to the velocity v'. It is exactly the force Fc that is 
the Coriolis force. 

Let us first find an expression for the Coriolis force in the particul- 
ar case when a particle m moves relative to a rotating reference frame 
uniformly along a circle in a plane perpendicular to the axis of rota- 
tion with its centre on this axis (Fig.4.7). Let v' stand for the velocity 


o 
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of the particle relative to the rotating frame. The velocity v of the 
particle relative to a fixed (inertial) reference frame has the magni- 
tude v' + (oR in case (a) and \v ' — coi?| in case (b), where co is the angu- 
lar velocity of the rotating frame, and R is the radius of the circle 
[see Eq. (1.99)1. 

For the particle to move relative to the fixed frame along a circle 
with the velocity v = v' + oi?, it must experience the force F di- 
rected toward the centre of the circle, for example, the force of ten- 



Fig. 4.7 


sion of the string by means of which the particle is tied to the centre 
of the circle (s e Fig. 4.7 a). The magnitude of this force is 

+ (4.10) 

The particle in this case moves relative to the rotating frame with 
the acceleration a n — v' 2 /R, i.e. as if it experienced the force 

ma‘ n — mv * = F — 2mi/co — m<a 2 R (4.11) 

[see Eq. (4.10)]. Thus, the particle behaves in the rotating frame as 
if two other forces directed away from the centre acted on it in addi- 
tion to the force F directed toward the centre. These two forces are 
F cf = mco 2 R and F c whose magnitude equals 2 mv'(o (Fig. 4.7a). 
It is easy to see that the force F c can be represented in the form 

F c = 2m [v'a>] (4.12) 

The force (4.12) is exactly the Coriolis force. This force vanishes when 
v' = 0. The force F cf does not depend on v' — as we have already 
noted, it acts both on bodies at rest and on moving ones. 

For the case shown in Fig. 4.7 b, we have 

j? — ~ 1 = i^l-2mt/co + m(o 2 R 
Accordingly, 

as F -f- 2 mv'to — ma> 2 R 

R 
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Consequently, in a rotating frame, the particle behaves as if it ex- 
perienced two forces F and Fc directed toward the centre of the circle, 
and also the force F cf = m<o 2 R directed away from the centre (see 
Fig. 4.76). The force Fc in this case can be represented in the form of 
Eq. (4.12). 

Now let us pass over to finding an expression for the Coriolis force 
when a particle moves arbitrarily relative to a rotating reference 
frame. Let us associate the coordinate axes x\ y\ z' with the rotating 
frame, and make the axis z' coincide 
with the axis of rotation (Fig. 4.8). 

The position vector of the particle 
can therefore be represented in the 
form 

r ' = x'e’ x + y'e' v +z'e' t (4.13) 

where ei, e^, and e' z are the unit vec- 
tors of the coordinate axes. The unit 
vectors e' x and rotate together with 
the reference frame with the angular 
velocity o, whereas the unit vector 
e z remains stationary. 

The position of the particle relative to the fixed frame should be 
determined with the aid of the position vector r. The symbols r' and 
r, however, signify the same vector drawn from the origin of coordi- 
nates to the particle. An observer “living” in the rotating reference 
frame denoted this vector by r'. According to his observations, the 
unit vectors ei, %, e' z are stationary, therefore when differentiating 
Eq. (4.13), he treats these unit vectors as if they are constants. A 
stationary observer uses the symbol r. For him, the unit vectors e x 
and rotate with the velocity co (the unit vector e z is stationary). 
Therefore, when differentiating the expression (4.13) equal to r, 
he must treat e x and e^ as functions of t whose derivatives are 

• • 

e' x = oe^ f e' y = — coei (4. 14) 

[see Fig. 4.8 and Eq. (1.56); the unit vector e lx * perpendicular to 
e x equals e^, and the unit vector e ± y » perpendicular to equals 
— e x ]. For the second time derivatives of the unit vectors, we get 



e* = coe y = — co 2 e x , ey = — coe x = — co 2 ey 


( 4 . 15 ) 


Let us find the velocity of the particle relative to the rotating ref- 
erence frame. To do this, we differentiate the position vector (4.13) 
with respect to time, considering the unit vectors as constants: 


v' = r' = z'e’ x 4- i/e,', 4- z'e; 
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If we now differentiate this expression, we get the acceleration of the 
particle relative to the rotating reference frame: 


a' = v' = x'e'x + y'e' v + z'e£ (4-17) 

Now we shall find the velocity of the particle relative to the fixed 
reference frame. For this purpose, we shall differentiate the posi- 
tion vector (4.13) “from the positions” of the stationary observer. 
Using the symbol r instead of r' (recall that r = r'), we get 


v = r = x'e‘ x -f x'e' x + y'e^ + y'e^+ z'ej (4.18) 

Differentiating this expression with respect to t, we find the accelera- 
tion of the particle relative to the fixed frame: 


a = v — x e x 2x e x -f- x e x y Gy -f- 2 y Gy -f- y Gy -J- z'e* 


Taking into account Eqs. (4.14), (4.15), and (4.17), we can transform 
the above expression into the form: 


a = a' + 2<a {x’e’y — y'ei) — co 2 (x'e x -f y'ey) (4.19) 


Let us consider the vector product [<o, v'l. We shall represent it 
in the form of a determinant [see Eq. (1.33)1: 


[©, v'J = 


e* By 

(0 x c 0y 

VL Vy 


<»z 

V'z 


(4.20) 


According to Eq. (4.16), v' x = x , v' v = y', v’ z = z'. In addition, for 
the direction of the coordinate axes that we have selected, we have 
<o x = o)y — 0, (o z = co. Introduction of these values into Eq. (4.20) 
yields 


[©, v'] = 


e x Gy e* 
0 0 co 


= — ei(oy'-^-e^cox , 


(4.21) 


The result obtained shows that the second term of Eq. (4. 19) can be 
written in the form 2 [co, v'l. The expression in parentheses in the last 
term of Eq. (4.19) equals the component of the position vector r' 
perpendicular to the axis of rotation (to the axis z') [see Eq. (4.13)1. 
Let us denote this component by the symbol R (compare with 
Fig. 1.33). In view of everything said above, Eq. (4.19) can be writ- 
ten as follows: 


a = a'+2[co, v'J — co 2 R 


(4.22) 
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It follows from Eq. (4.22) that the acceleration of the particle 
relative to the fixed reference frame can be represented in the form 
of the sum of three accelerations: that relative to the rotating frame 
a', the acceleration equal to — cd 2 R*, and the acceleration 

a c = 2 [6>, v'] (4.23) 

called the Coriolis acceleration. 

For a particle to move with the acceleration (4.22), bodies must act 
on it with the resultant force F = ma. According to Eq. (4.22) 

ma' = ma — 2m [co, v , ] + ™o> 2 R = F-4-2m[v / , o>] + m(D 2 R (4.24) 

(transposition of the multipliers changes the sign of the vector prod- 
uct). The result obtained signifies that when compiling an equation 
of Newton’s second law for a rotating reference frame, in addition to 
the forces of interaction account must be taken of the centrifugal 
force of inertia determined by Eq. (4.5), and also of the Coriolis 
force which even in the most general case is determined by Eq. (4.12). 
We must note that the Coriolis force is always in a plane perpendic- 
ular to the axis of rotation. 

It follows from a comparison of Eqs. (4.18), (4.16), and (4.14) 
that 

v = v' + x't'x + y'e v = v' + <o (x'e y — y'e x ) 

Calculations similar to those which led us to Eq. (4.22) can help us 
see that the last term of the above expression equals [<o, r']. Hence, 

v = v' + [(0, r'] (4.25) 

When v' = 0, this equation transforms into Eq. (1.100). 

Examples of Motions in Which the Coriolis Force Manifests Itself. 
In interpreting phenomena associated with the motion of bodies rel- 
ative to the Earth’s surface, it is sometimes necessary to take account 
of the influence of Coriolis forces. For example, in the free fall of 
bodies, a Coriolis force acts on them that causes them to deviate to 
the East from a vertical line (Fig. 4.9). This force is the greatest at 
the equator and vanishes at the poles. 

A flying projectile also experiences deviations due to Coriolis forces 
(Fig. 4.10). When a projectile is fired from a gun facing North, it 
will deviate to the East in the northern hemisphere and to the West 
in the southern one. If a projectile is fired along a meridian to the 
South, the deviations will be the reverse. If a projectile is fired along 
the equator, Coriolis forces will press it toward the Earth if the shot 
was directed to the West, and lift it if the shot was directed to the 

♦ The acceleration atr = — ©*R is called transferable. It is the acceleration 
which a particle would have being at rest in a moving (in our case in a rotating) 
reference frame. 
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East. We invite our reader to convince himself that the Coriolis 
force acting on a body moving along a meridian in any direction (to 
the North or South) has a rightward direction relative to that of 
motion in the northern hemisphere and a leftward one in the southern 
hemisphere. This is why rivers always wash out their right banks in 
the northern hemisphere and their left banks in the southern one. 
This is also why the rails of a double-track railway wear differently. 





The Coriolis forces also manifest themselves in the oscillations 
of a pendulum. Figure 4.11 shows the trajectory of a pendulum bob 
(it is assumed for simplicity’s sake that the pendulum is at a pole). 
At the north pole, the Coriolis force will constantly be directed to the 
right in the direction of the pendulum’s motion, and at the south pole 
to the left. As a result, the trajectory has the shape of a rosette. 

As can be seen from the figure, the plane of oscillations of the pen- 
dulum turns clockwise relative to the Earth, and it completes one 
revolution a day. Relative to a heliocentric reference frame, the plane 
of oscillations remains unchanged, while the Earth rotates complet- 
ing one revolution a day. It can be shown that at the latitude <p the 
plane of oscillations of a pendulum turns through the angle of 2n sin 9 
in a day. 

Thus, observations of the rotation of the plane in which a pendu- 
lum oscillates (pendulums intended for this purpose are called Fou- 
cault pendulums) provide a direct proof of the Earth’s rotation about 
its axis. 
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4.4. Laws of Conservation in Non-Inertial 
Reference Frames 

The equations of motion in a non-inertial frame do not differ in 
any way from those of motion in an inertial reference frame when the 
forces of inertia are taken into account. Therefore, all the corollaries 
following from the equations of motion, particularly Eqs. (3.77), 
(3.88), and (3.118), also hold in non-inertial reference frames. 

Equation (3.77) acquires the following form for a non-inertial 
frame: 

— -^ 12 , non-cons "4* ■^12. In (4.26) 

where A 12 , ln is the work done by the forces of 
inertia. 

Equations (3.88) and (3.118) can be written 
as follows for a non-inertial frame: 

4-P=S F ext + 2 F m (4-27) 

4-L=2 M «t+2M ln (4.28) 

Here F ex t = force due to interaction 
F» = force of inertia 

M ex t and M ln = moments of the above forces. 

The centrifugal force of inertia F C f = raa> 2 R is conservative. In- 
deed, the work of this force is 

2 2 
A i2$ct = l F cf dr = mco 2 \ R dr 

i i 

Inspection of Fig. 4.12 shows that the projection of the vector dr on 
the direction of the vector R equals dR — the increment of the magni- 
tude of R. Consequently, R dr = R dR — d (i? 2 / 2). Thus, 

2 

A i2tCt = mco 2 j d = mw 2 -—-— (4.29) 
i 

The expression obtained does not obviously depend on the path along 
which the displacement from point 1 to point 2 occurred. 

The conservative nature of the force F C f makes it possible to intro- 
duce the potential energy of a particle E v% ct (the centrifugal energy) 
whose decrement determines the work of the centrifugal force of 
inertia: 

Ai 2 , ct ^ ^p t cf t i — ct, 2 



(4.30) 
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Isee Eq. (3.30)1. A comparison of Eqs. (4.29) and (4.30) shows that 
Ep.et = 2 mo) a i? 2 + const. We may assume that the constant 

equals zero. We thus get the following expression for the centrifugal 
energy: 

E PtCt (4.31) 

If we add Eq. (4.31) to the potential energy of a particle, then the 
work of the centrifugal force of inertia must not be included in the 
quantity A 12iln in Eq. (4.26). 



CHAPTER 5 MECHANICS OF 

A RIGID BODY 


5.1. Motion of a Body 

In Sec. 1.1, we acquainted ourselves with the two fundamental 
kinds of motion of a rigid body — translation and rotation. 

In translation, all the points of a body receive displacements equal 
in magnitude and direction during the same time interval. Conse- 
quently, the velocities and accelerations of all the points are identi- 
cal at every moment of time. It is therefore sufficient to determine 
the motion of one of the points of a body (for example, of its centre 
of mass) to completely characterize the motion of the entire body. 

In rotation, all the points of a rigid body move along circles whose 
centres are on a single straight line called the axis of rotation. To 
describe rotation, we must set the position of the axis of rotation in 
space and the angular velocity of the body at each moment of time. 

Any motion of a rigid body can be represented as the superposition 
of the two fundamental kinds of motion indicated above. We shall 
show this for plane motion, i.e. motion when all the points of a body 
move in parallel planes. An example of plane motion is the rolling 
of a cylinder along a plane (Fig. 5.1). 

The arbitrary displacement of a rigid body from position 1 to 
position 2 (Fig. 5.2) can be represented as the sum of two displace- 
ments — translation from position 1 to position V or l n , and rotation 
about the axis O' or the axis O It is quite obvious that such a divi- 
sion of a displacement into translation and rotation can be performed 
in an infinite multitude of ways, but in any case rotation occurs 
through the same angle cp. 

In accordance with the above, the elementary displacement of a 
point of a body ds can be resolved into two displacements — the 
“translational” one ds tr and the “rotational” one ds rot : 

ds = ds tr -f~ tferot 


where dstr is the same for all the points of the body. This resolution 
of the displacement ds , as we have seen, can be performed in different 
ways. In each of them, the rotational displacement ds ro t is performed 
by rotation of the body through the same angle d<p (but relative to 
different axes), whereas ds tr and ds rot are different. 

Dividing ds by the corresponding time interval dt y we get the 
velocity of a point: 


v 


ds ds . ds ro t 
dt ” dt ■ dt 


= v 0 +v' 
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where v a = velocity of translation, which is the same for all the 
points of a body 

v' = velocity due to rotation, which is different for different 
points of the body. 

Thus, the plane motion of a rigid body can be represented as the 
sum of two motions — translation with the velocity v 0 and rotation 
with the angular velocity co (the vector o> in Fig. 5.1 is directed at 



right angles to the plane of the drawing, beyond it). Such a repre- 
sentation of complex motion can be accomplished in many ways 
differing in the values of v 0 and v', but corresponding to the same 
angular velocity <o. For example, the motion of a cylinder rolling 
without slipping along a plane (Fig. 5.1) can be represented either as 
translation with the velocity v 0 and simultaneous rotation with the 
angular velocity <d about the axis O y or as translation with the ve- 
locity vo = 2v 0 and rotation with the same angular velocity <i> 
about the axis O", or, finally, as only rotation, again with the same 
angular velocity o> about the axis O ' . 

Assuming that the reference frame relative to which we are con- 
sidering the complex motion of a rigid body is stationary, the motion 
of the body can be represented as rotation with the angular velocity 
cd in a reference frame moving translationally with the velocity v 0 
relative to the stationary frame. 

The linear velocity v' of a point with the position vector r due to 
rotation of a rigid body is v' = [or] see Eq. (1.100). Consequently, 
the velocity of this point in complex motion can be represented in 
the form 

v = v 0 + lor] (5.1) 

An elementary displacement of a rigid body in plane motion can 
always be represented as rotation about an axis called the instanta- 
neous axis of rotation. This axis may be either inside the body or 
outside it. The position of the instantaneous axis of rotation relative 
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to a fixed reference frame and relative to the body itself, generally 
speaking, changes with time. For a rolling cylinder (Fig. 5.1), the 
instantaneous axis O' coincides with the line of contact of the cylin- 
der with the plane. When the cylinder rolls, the instantaneous axis 
moves both along the plane (i.e. relative to a fixed reference frame) 
and along the surface of the cylinder. 

The velocities of all the points of the body for each moment of time 
can be considered as due to rotation about the corresponding in- 
stantaneous axis. Consequently, plane motion of a rigid body can be 
considered as a number of consecutive elementary rotations about 
instantaneous axes. 

In non-planar motion, an elementary displacement of a body can 
be represented as rotation about an instantaneous axis only if the 
vectors v 0 and a> are mutually perpendicular. If the angle between 
these vectors differs from ji/2, the motion of the body at each moment 
of time will be the superposition of two motions — rotation about a 
certain axis, and translation along this axis. 


5.2. Motion of the Centre 
of Mass of a Body 

By dividing a body into elementary masses m u we can represent 
it as a system of point particles whose mutual arrangement remains 
unchanged. Any of these elementary masses may be acted upon both 
by internal forces due to its interaction with other elementary masses 
of the body being considered, and by external forces. For example, if 
a body is in the field of the Earth’s gravitational forces, each ele- 
mentary mass of the body m t will experience an external force equal 

to Ttlig. 

Let us write the equation of Newton’s second law for each ele- 
mentary mass: 

mia t = f, + Fj (5.2) 

where f { is the resultant of all the internal forces, and F ( the resultant 
of all the external forces applied to the given elementary mass. Sum- 
mation of Eqs. (5.2) for all the elementary masses yields 

2 "fit = 2 f| + 2 F, (5.3) 

The sum of all the internal forces acting in a system, however, equals 
zero. Hence, Eq. (5.3) can be simplified as follows: 

2 m i a i = 2 F, . (5.4) 

Here the resultant of all the external forces acting on the body is in 
the right-hand side. 
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The sum in the left-hand side of Eq. (5.4) can be replaced with the 
product of the mass of the body m and the acceleration of its centre 
of mass (centre of inertia) a<j. Indeed, according to Eq. (3.91), we 
have 

2 rriiTt = mr c 

Differentiating this relation twice with respect to time and taking 
into account that r t = a ( , and rc = ac, we can write 

2 i = ma c (5.5) 

Comparing Eqs. (5.4) and (5.5), we arrive at the equation 

ma c = 2 F ext > (5.6) 

which signifies that the centre of mass of a rigid body moves in the same 
way as a point particle of a mass equal to that of the body would move 
under the action of all the forces applied to the body. 

Equation (5.6) permits us to find the motion of the centre of mass 
of a rigid body if we know the mass of the body and the forces acting 
on it. For translation, this equation will determine the acceleration 
not only of the centre of mass, but also of any other point of the 
body. 

5.3. Rotation of a Body about 
a Fixed Axis 

Let us consider a rigid body that can rotate about a fixed vertical 
axis (Fig. 5.3). We shall confine the axis in bearings to prevent its 
displacements in space. The flange FI resting on the lower bearing 
prevents motion of the axis in a vertical direction. 

A perfectly rigid body can be considered as a system of particles 
(point particles) with constant distances between them. Equation 
(3.118), i.e. 

■E-L- 2 **« 

holds for any system of particles, including a rigid body. In the lat- 
ter case, L is the angular momentum of the body. The right-hand side 
of Eq. (3.118) is the sum of the moments of the external forces acting 
on the body. 

Let us take point O on the axis of rotation and characterize the 
position of the particles forming the body with the aid of position 
vectors r drawn from this point (Fig. 5.3 depicts the i-th particle of 
mass m t ). According to Eq. (3.105), the angular momentum of the 
i-th particle relative to point O is 

L i = [rj, m^i] = m t lr it v*l 


(5.7) 
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The vectors r ; and v t are mutually perpendicular for all the particles 
of the body. Therefore, the magnitude of the vector L, [Eq. (5.7)] is 

L t = m t r t v, = (5.8) 

(see Eq. (1.99)1. The direction of the vector L t is shown in Fig. 5.4. 
It must be noted that the “length” of the vector L,, according to Eq. 




z h 

I 

t l 

I 


(5.8), is proportional to the velocity of rotation of the body to. The 
direction of the vector L i? however, is independent of co. The vector 
L t is in a plane passing through the axis 
of rotation and the particle m* and is 
perpendicular to r*. 

The projection of the vector L f onto the 
axis of rotation (the z-axis), as can be 
seen from Fig. 5.4, is [see Eq. (5.8)] 

L zi = L t cos a = m i r i (})R i cos a = 

= mi ( r t cos a) R t co = co (5.9) 

It is not difficult to see that for a homo- 
geneous* body which is symmetrical rela- 
tive to the axis of rotation (for a homoge- 
neous body of revolution), the directions 
of the total angular momentum (equal to 
2 Li) and of o> along the axis of rotation 
are the same (Fig. 5.5). Indeed, in this 
case, the body Can be divided into pairs 
of symmetrically arranged particles of 
equal mass (two pairs of particles are 

shown in the figure — and m h -m'k). The sum of the angular mo- 
menta of each pair (in the figure L\ + 14 and L ft +Li ) is directed 
along the vector co. Hence, the total angular momentum L will also 

* In mechanics, a body is defined as homogeneous when its density is the 
same throughout the entire volume (see Sec. 5.4). 
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coincide in direction with co. The magnitude of the vector L in this 
case equals the sum of the projections of the momenta L* onto the 
3 -axis. Taking Eq. (5.9) into account, we get the following expression 
for the magnitude of the angular momentum of a body: 

L = 2 L z i = to 2 m t R\ — 1& (5.10) 

The quantity I equal to the sum of the products of the elementary 
masses and the squares of their distances from a certain axis is called 





the rotational inertia or the moment of inertia of the body relative 
to the given axis: 

(5.11) 

Summation is performed over all the elementary masses m t into 
which the body was mentally divided. 

With a view to the fact that the vectors L and a> have identical 
directions, we can write Eq. (5.10) as follows: 

L = /co (5.12) 

We remind our reader that we have obtained this relation for a homo- 
geneous body rotating about an axis of symmetry. In the general case, 
as we shall see below, Eq. (5.12) is not obeyed. 

For an asymmetrical (or non-homogeneous) body, the angular 
momentum L, generally speaking, does not coincide in direction 
with the vector ©. The dash line in Fig. 5.6 shows the part of an asym- 
metrical homogeneous body that is symmetrical relative to the axis 
of rotation. The total angular momentum of this part, as we have 
established above, is directed along to. The momentum L< of each 
particle not belonging to the symmetrical part deviates to the right 
from the axis of rotation (in a plane figure). Consequently, the total 
angular momentum of the entire body will also deviate to the right 
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(Fig. 5.7). Upon rotation of the body, the vector L rotates together 
with it, describing a cone. During the time dt, the vector L receives 
the increment dL, which according to Eq. (3.118) equals 

dL = (2M ext )<fr (5.13) 

If the vector L does not change in magnitude, then the vector dL is 
directed beyond the drawing (Fig. 5.7). The vector 2 Mext has the 
same direction. In the example we are treating, the moments of the 
external forces include (1) the moment of the force of gravity mg 
directed toward us — we shall call it negative (this force is applied to 
the centre of mass of the body C), (2) the positive moments of the 
forces of lateral pressure of the bearings on the axis (the forces F 1 
and F 2 ), and (3) the positive moment of the force of pressure of the 
bearing shoulder on the flange F 3 . We assume that friction forces are 
absent, otherwise the vector L would not be constant in magnitude, 
and dL would not be perpendicular to L. 

The angular momentum relative to the axis of rotation [see Eq. 
(3.108)] for any (homogeneous or non-homogeneous, symmetrical or 
asymmetrical) body is 

Z, z = s L zl = 2 = 7(0 (5.14) 

[see Eqs. (5.9) and (5.11)]. It must be stressed that unlike Eq. (5.12), 
Eq. (5.14) is always correct. 

Equation (3.119) states that 

Introducing into this expression Eq. (5.14) for L zy we get 

/cc z =2M z , eit (5.15) 

• 

where a z = co is the projection of the angular acceleration onto the 
2 -axis (we are considering rotation about a fixed axis, therefore the 
vector <a can change only in magnitude). Equation (5.15) is similar 
to the equation ma — F. The part of the mass is played by the mo- 
ment of inertia, that of the linear acceleration by the angular accel- 
eration, and, finally, the part of the resultant force is played by the 
total moment of the external forces. 

In the above example, the moments of all the external forces are 
perpendicular to the axis of rotation. Hence, their projections onto 
the z-axis equal zero. Accordingly, the angular velocity © remains 
constant, which is what should be expected in the absence of friction. 

We must point out that in the rotation of a homogeneous symmet- 
rical body, forces of lateral pressure of the bearings on the axis (the 
forces F x and F 2 in Fig. 5.7) do not appear. In the absence of the force 
of gravity, we could remove the bearings — the axis would retain its 
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position in space without them. An axis whose position in space re- 
mains constant when bodies rotate about it in the absence of external 
forces is called a free axis of a body. 

It is possible to prove that for a body of any shape and with an 
arbitrary arrangement of its mass there are three mutually perpendic- 
ular axes passing through the centre of mass of the body that can be 
free axes. They are called the principal axes of inertia of the body. 



O r Or 

Fig. 5.8 Fig. 5.9 


In a homogeneous parallelepiped (Fig. 5.8), the principal axes of 
inertia are obviously the axes O x 0 ly 0 2 0 2J and 0 3 0 3 passing through 
the centres of opposite faces. 

In bodies possessing axial symmetry (for example, in a homoge- 
neous* cylinder), the axis of symmetry is one of the principal axes 
of inertia. Any two mutually perpendicular axes in a plane at right 
angles to the axis of symmetry and passing through the centre of 
mass of the body can be the other two principal axes (Fig. 5.9). Thus, 
in such a body only one of the principal axes of inertia is fixed. 

In a body with central symmetry, i.e. in a sphere whose density 
depends only on the distance from its centre, any three mutually 
perpendicular axes passing through the centre of mass are the prin- 
cipal axes of inertia. Consequently, none of the principal axes of 
inertia is fixed. 

The moments of inertia relative to the principal axes are called 
the principal moments of inertia of a body. In the general case, 
these moments differ: I x =^ I 2 =^= J 3 . Fora body with axial symmetry, 
two of the principal moments of inertia are the same, while the third 
one, generally speaking, differs from them: I x = I t / 3 . And, final- 

* It is sufficient that the density of the.body in each cross section be a func- 
tion only of the distance from the axis of symmetry. 
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ly, for a body with central symmetry, all three principal moments of 
inertia are the same: I x = I 2 = / 3 . 

Not only a homogeneous sphere, but also, for instance, a homoge- 
neous cube has equal values of the principal moments of inertia. In 
the general case, such equality may be observed for bodies of an ab- 
solutely arbitrary shape when their mass is properly distributed. All 
such bodies are called spherical tops. Their feature is that any axis 
passing through their centre of mass has the properties of a free axis, 
and, consequently, none of the principal axes is fixed, as for a sphere. 
All spherical tops behave the same when they rotate in identical 
conditions. 

Bodies for which I x = J a = 7 ^ ^3 behave like homogeneous bodies 
of revolution. They are called symmetrical tops. Finally, bodies for 
which I x ^= / 3 are called asymmetrical tops. 

If a body rotates in conditions when there is no external action, 
then only rotation about the principal axes corresponding to the 
maximum and minimum values of the moment of inertia is stable. 
Rotation about an axis corresponding to an intermediate value of 
the moment will be unstable. This signifies that the forces appearing 
upon the slightest deviation of the axis of rotation from this principal 
axis act in a direction causing the magnitude of this deviation to 
grow. When the axis of rotation deviates from a stable axis, the for- 
ces produced return the body to rotation about the corresponding 
principal axis. 

We can convince ourselves that what has been said above is true 
by tossing a body having the shape of a parallelepiped (for example, 
a match box) and simultaneously bringing it into rotation*. We 
shall see that the body when falling can rotate stably about axes 
passing through the biggest or smallest faces. Attempts to toss the 
body so that it rotates about an axis passing through the faces of 
an intermediate size will be unsuccessful. 

If an external force is exerted, for instance, by the string on which 
a rotating body is suspended, then only rotation about the principal 
axis corresponding to the maximum value of the moment of inertia 
will be stable. This is why a thin rod suspended by means of a string 
fastened to its end when brought into rapid rotation will in the long 
run rotate about an axis normal to it passing through its centre 
(Fig. 5.10a). A disk suspended by means of a string fastened to its 
edge (Fig. 5.10&) behaves in a similar way. 

Up to now, we have treated bodies with a constant distribution of 
their mass. Now let us assume that a rigid body can lose for a cer- 
tain time its property of a constant arrangement of its parts, and 
within this time redistribution of the body’s mass occurs that results 


* The action of the force of gravity in this case is not significant. It only 
causes the body to fall in addition to its rotation. 
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in the moment of inertia changing from I x to J 2 . If such a redistribu- 
tion occurs in conditions when 2 M ex t = 0, then in accordance with 

the law of conservation 




of angular momentum 
the following equation 
must be observed: 

“ 1 2^2 (5.16) 

where is the initial, 
and co 2 is the final value 
of the angular velocity 
of the body. Thus, a 
change in the moment of 
inertia leads to a cor- 


responding change in 
Fig. 5.10 the angular velocity. 

This explains why a 
spinning figure skater (or a man on a rotating platform) begins to 
rotate more slowly when he stretches his arms out, and gains speed 
when he presses his arms against his body. 


5.4. Moment of Inertia 

From the definition of the moment of inertia [see Eq. (5.11)] 

/=2 Am**?* 

we can see that it is an additive quantity. This signifies that the 
moment of inertia of a body equals the sum of the moments of inertia 
of its parts. 

We introduced the concept of the moment of inertia when dealing 
with the rotation of a rigid body. It must be borne in mind, however, 
that this quantity exists irrespective of rotation. Every body, re- 
gardless of whether it is rotating or at rest, has a definite moment of 
inertia relative to any axis, just like a body has a mass regardless 
of whether it is moving or at rest. 

The distribution of the mass within a body can be characterized 
with the aid of a quantity called the density. If a body is homoge- 
neous, i.e. its properties are the same at all of its points, then the 
density is defined as the quantity 

P = -f (5-17) 

* In this section, it is expedient to use the symbol Am* instead of m* for 
thp plp.mentarv mass of a body. 
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where m and V are the mass and volume of the body, respectively. 
Thus, the density of a homogeneous body is the mass of a unit of its 
volume. 

For a body with an unevenly distributed mass, Eq. (5.17) gives 
the average density. The density at a given point is determined in 
this case as follows: 


p = lim 

AV-0 


Am 

~AV 


dm 

~3v 


(5.18) 


In this expression, Am is the mass contained in the volume AF, 
which in the limit transition contracts to the point at which the 
density is being determined. 

The limit transition in Eq. (5.18) must not be understood in the 
sense that AF contracts literally to a point. If such a meaning is 
implied, we would get a greatly differing result for two virtually coin- 
ciding points, one of which is at the nucleus of an atom, while the 
other is at a space between nuclei (the density for the first point would 
be enormous, and for the second one it would be zero). Therefore, 
AF should be diminished until we get an infinitely small volume 
from the physical viewpoint . We understand this to mean such a vol- 
ume which on the one hand is small enough for the macroscopic (i.e. 
belonging to a great complex of atoms) properties within its limits 
to be considered identical, and on the other hand is sufficiently great 
to prevent discreteness (discontinuity) of the substance from mani- 
festing itself. 

By Eq. (5. 18), the elementary mass A m t equals the product of the 
density of a body p* at a given point and the corresponding elemen- 
tary volume AV t : 

A m t = p|AF| 

Consequently, the moment of inertia can be written in the form 

/=2p,fljAF, (5.19) 

If the density of a body is constant, it can be put outside the sum: 


/ = (5.20) 

Equations (5.19) and (5.20) are approximate. Their accuracy grows 
with diminishing elementary volumes AFj and the elementary masses 
A m t corresponding to them. Hence, the task of finding the moments 
of inertia consists in integration: 

p R*dV (5.21) 

The integrals in Eq. (5.21) are taken over the entire volume of the 
body. The quantities p and R in these integrals are position functions, 
i.e., for example, functions of the Cartesian coordinates x> y, and z . 
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As an example, let us find the moment of inertia of a homogeneous 
disk relative to an axis perpendicular to the plane of the disk and 
passing through its centre (Fig. 5.11). Let us divide the disk into 
annular layers of thickness dR . All the points of one layer 

will be at the same distance R from the 
The volume of such a layer is 



axis. 


dV = b2nR dR 


where b is the thickness of the disk. 

Since the disk is homogeneous, its den- 
sity at all its points is the same, and p in 
Eq. (5.21) can be put outside the integral: 

Ho 

/ = p j R*dV = p j RmnRdR 


where R 0 is the radius of the disk. Let us put the constant factor 
2rcb outside the integral: 

I = 2ix&p j R* dR = 2nbp ^ 
o 


Finally, introducing the mass of the disk m equal to the product of 
the density p and the volume of the disk we get 


r 

~ 2 


(5.22) 


The finding of the moment of inertia in the above example was 
simplified quite considerably owing to the fact that the body was 
homogeneous and symmetrical, and we sought the moment of inertia 
relative to an axis of symmetry. If we wanted to find the moment of 
inertia of the disk relative, for example, to the axis O'O' perpendicu- 
lar to the disk and passing through its edge (see Fig. 5.11), the calcula- 
tions would evidently be much more complicated. The finding of the 
moment of inertia is considerably simplified in such cases if we use 
the Steiner or parallel axis theorem, which is formulated as follows: 
the moment of inertia I relative to an arbitrary axis equals the moment 
of inertia / c relative to an axis parallel to the given one and passing 
through the body's centre of mass plus the product of the body' s mass m 
and the square of the distance b between the axes: 

/ = /c = mb 2 (5.23) 

According to the parallel axis theorem, the moment of inertia of 
the disk relative to the axis O'O' equals the moment of inertia rela- 
tive to the axis passing through the centre of the disk, which we have 
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found [Eq. (5.22)] plus mR\ (the distance between the axes O'O' and 
00 equals the radius of the disk R 0 ): 

Thus, the parallel axis theorem in essence reduces the calculation 
of the moment of inertia relative to an arbitrary axis to the calcula- 
tion of the moment of inertia rela- 
tive to an axis passing through the 
centre of mass of the body. 

To prove the parallel axis theo- 
rem, let us consider axis C passing 
through the centre of mass of a body 
and axis O parallel to it and at 
a distance b from axis C (Fig. 5.12, 
both axes are perpendicular to the 
plane of the drawing). Let R t be 
a vector perpendicular to axis C 
and drawn from the axis to the 
elementary mass Am*, and R, be a 
similar vector drawn from axis O . We shall also introduce the vector 
b perpendicular to the axes and connecting the corresponding points 
of axes O and C. For any pair of points opposite each other, this 
vector has the same value (equal to the distance b between the axes) 
and the same direction. The following relation holds between the 
vectors listed above: 

Ri = b + R* 

The square of the distance to the elementary mass A from axis 
C is R\ = R?, and from axis O is 

i?i 2 = (b + R t ) 2 = b 2 + 2bRi + Rl 

With a view to the above expression, the moment of inertia of the 
body relative to axis O can be written in the form 

/ = 2 A mtR? = 6 2 2 A Ttii + 2b S Arn^i + 2 &m t R\ (5.24) 

(we have put the constant factors outside the sum). The last term in 
this expression is the moment of inertia of the body relative to axis 
C. Let us designate it Iq- The sum of the elementary masses gives the 
mass of the body m. The sum 2 AtfijR* equals the product of the mass 
of the body and the vector R drawn from axis C to the centre of mass 
of the body. Since the centre of mass is on axis C, this vector R and, 
consequently, the second term in Eq. (5.24) vanish. We thus arrive 
at the conclusion that 

/ = mb 2 + / G 

Q.E.D. [see Eq. (5.23)]. 
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In concluding, we shall give the values of the moments of inertia 
for selected bodies (the latter are assumed to be homogeneous, m 
is the mass of the body). 

1. The body is a thin long rod with a cross section of any shape. 
The maximum cross-sectional dimension b of the rod is many times 
smaller than its length / (b -C l). The moment of inertia relative to 



Fig* 5.13 Fig. 5.14 


an axis perpendicular to the rod and passing through its middle 
(Fig. 5.13) is ’ 

/ = TJ mVL (5.25) 


2. For a disk or cylinder with any ratio of R to l (Fig. 5.14), the 
moment of inertia relative to an axis coinciding with the geometrical 

axis of the cylinder is 

I — \ mR z (5.26) 

3. The body is a thin disk. The thick- 
ness of the disk b is many times smal- 
ler than the radius of the disk 
R (b < R). The moment of inertia relative 
to an axis coinciding with the diameter of the disk (Fig. 5.15) is 

/ = (5.27) 





Fig. 5.15 


4. The moment of inertia of a sphere of radius R relative to an 
axis passing through its centre is 

7 = -§-mf?2 (5.28) 


5.5. Concept of Inertia Tensor 

We established in Sec. 5.3 that for a homogeneous body rotating 
about an axis of symmetry, the relation between the vectors L and 
<o has a very simple form [Eq. (5.12)] 

L = 7© 
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or 

= Ly = I(& yy L Z = I& Z (5.29) 

The explanation is that for such a body the vectors L and © are collin- 
ear. In the general case, however, the vectors L and © make an angle 
differing from zero (see Fig. 5.7), so that the relation between them 
cannot be expressed by Eq. (5.12). 

Let ns try to find a way of relating the vectors L and © analytically 
in the most general case. We shall proceed from the fact that the 
magnitudes of L and © are proportional to each other. Indeed, accord- 
ing to Eq. (5.8), the magnitudes of the elementary vectors L* are 
proportional to the magnitude of ©. Hence, the magnitude of the 
sum of these vectors is also proportional to ©. It is easy to understand 
that such proportionality is obtained when each component of the 
vector L depends linearly on the components of the vector ©: 

L x = I xx® x I xy®y ~f" ^ x z®z 

Ly = Iy x (O x + Iyytoy + lyz^Z (5.30) 

L z = ^ zx^x~\~ I zy®y "F % zz®z 

Here the quantities I xxy I xyy etc. are proportionality constants hav- 
ing the dimension of the moment of inertia [compare with Eq. (5.29)]. 
When © increases a certain number of times, each of the components 
c o xy co y , © z , and accordingly each of the components L xy L yy L z 
grows the same number of times, as, consequently, does the vector 
L itself. 

The mutual orientation of the vectors L and © is determined by the 
values of the proportionality constants. Assume, for example, that 
I xx = I yy = I zz = /, and the remaining constants equal zero. In 
this case, Eqs. (5.30) transform into Eqs. (5.29), i.e. the vectors L and 
© will be collinear. Now let us assume that the vector © is directed 
along the z-axis, and the constants I xzy I yzy I zz differ from zero. In 
this case © z = ©, co x = © y = 0. Substitution of these values in 
Eqs. (5.30) yields 

L x == I XZ® 0, Ly — I y z (0 0, L z == I Z 2 © 0 

All three components of the vector L differ from zero. Hence, the 
vector L makes a certain angle with the vector © directed along the 
z-axis. 

It follows from the above that in the most general case the rela- 
tion between the angular momentum and the angular velocity of a 
body can be expressed with the aid of Eqs. (5.30). Similar equations 
can be written for any vectors a and b whose magnitudes are propor- 
tional to each other: 

b x = T xx d x -f- T xy d y -f- T xz a z 

by = T yx a x + T yydy ~f“ T y z d z 

b z ~ T zx d x -|- T zy a y -f* T zz d z 


( 5 . 31 ) 
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These three equations can be written compactly in the form of a single 
expression: 


b t = S T ihak (i = x, y , z) 

k«x, y , z 


(5.32) 


Assuming that i = x and performing summation with the subscript 
k sequentially having the values x, y , z, we get the firstof the equa- 
tions (5.31), assuming that i = y, we get the second equation, etc. 

The combination of the nine quantities T xxt T xy% ...» T zz is 
called a tensor of rank two*, and the operation expressed by Eqs. 
(5.31) is called multiplication of the vector a by the tensor T. Such 
multiplication produces a new vector b. 

It is customary practice to write a tensor in the form of a square 
table: 

/ T xx T xy T xz \ 

T= \Tyx Tyy T yA (5.33) 

\T ZX T zy Tj 


(we can write the subscripts 1, 2, 3 instead of x , y, z). The quantities 
T X z ci r xy , . . . are defined as the components of the tensor. The com- 
ponents T XX i T yyy T zz along the diagonal of table (5.33) are called 
diagonal ones. The values of the components depend on the choice of 
the coordinate axes onto which the vectors a and b are projected (the 
components of these vectors also depend on the choice of the axes). 

A comparison of Eqs. (5.30) and (5.31) shows that the constants in 
Eqs. (5.30) are the components of a tensor of rank two: 



(5.34) 


It is called the inertia tensor of a body. This tensor characterizes 
the inertia properties of a body in rotation. 

To find the values of the components of the inertia tensor, we shall 
proceed from the definition of the angular momentum of a body: 

L = 2 [r„ Vi ] (5.35) 

[see Eq. (5.7)1. We shall plot the vectors r* from the centre of mass of 
a body (Fig. 5.16). Let us substitute the vector product [<o, rj for 
the velocity v* in Eq. (5.35) [see Eq. (1.100)1. We get 

L= 2 t r i» [0>. **]] 


* A tensor of rank two is defined as a combination of the nine quantities 
T X y\. •••» T zz that transform according to definite rules upon rotations of 
the coordinate axes. 
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We shall now use Eq. (1.35): 

L = 2 m, {© (rjr,) — n (r 4 <o)> (5.36) 

We remind our reader that summation is conducted of all the elemen- 
tary masses into which we have mentally divided the body. 

Let us associate a Cartesian system of coordinates with the body* 
(see Fig. 5.16) and write the scalar products figuring in Eq. (5.36) 
through the components of the vectors cd and r t along the axes of this 
system [see Eq. (1.23)1. We place the origin of coordinates at the 
centre of mass of the body C (it must be remembered that we plotted 
the vectors r* from this point). Taking 
into account that r xi = x iy r yi = y t , 
r zi = z u we get 

L = 2 {© (**+y?+Zi)— 

— r,(*,© x -f- + Zjto*)} (5.37) 

Let us find the projection of this vector 
onto the ar-axis: 

L x = ^ 1 m i {a> x (x\ + z?) — 

— x t (. x t o> x + y t a> y + ZiO> z )} = 

= ©* 2 m t ( y\ + z?) — (o y 2 mpa, — 

— © z 2 rriiXiZi (5.38) 

In a similar way, we find the projections of the vector L onto the axes 
y and z: 

L y = — G)* 2 mfliXi + (o y 2 mi (x\ + z\) — co 2 2 m iyi z i (5.39) 

L z = — (o x 2 rriiZiXi — <* y 2 mpiVi + w z S^r (*? + Ui) (5.40) 

A comparison of the expressions obtained with Eqs. (5.30) allows 
us to find the values of the components of the inertia tensor. Let us 
write these values at once in the form of a table: 


{ 2"»i(j/*+Zi) 

— 2 m i x iyt 

— 2 \ 

7 = 1—2 m-MiXt 

2 m l (•£* + Z i) 

— 2 miypi I 

\ — 2 

— 2 n-PiVi 

(x 2 i+yi)J 


The diagonal components of the tensor are the moments of inertia 
relative to the corresponding coordinate axes considered in the pre- 
ceding section. These components are called axial moments of inertia. 
The non-diagonal components are called centrifugal moments of 

* It must be stressed that the axes of this system are rigidly associated with 

the body and rotate together with it. 
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inertia. It must be noted that the non-diagonal components of the 
tensor (5.41) comply with the condition that I xy = I yx , I xz = I zx , 
I yz = I zy . A tensor complying with such a condition is called sym- 
metrical. 

In practice t the inertia tensor components are computed with the 
aid of integration. For example, the component I xx is determined by 
the formula 

Ixx= j p(«, y, 2 ) (y 2 +Z 2 ) dv 

where p (x f y, z) is the density, and dV is the elementary volume. 
Integration is performed over the entire volume of the body. 



Fig. 5.17 


Let us find the components of the inertia tensor for a homogeneous 
rectangular parallelepiped. We select the coordinate axes as shown in 
Fig. 5.17. The origin of coordinates coincides with the centre of mass 
of the body C. To calculate the axial moment of inertia I Z2 , we divide 
our parallelepiped into columns with a base area of dx dy . All the 
elements of such a column have identical values of the coordinates x 
and y. The volume of a column is 2c dx dy , and its mass dm is 
p2c dx dy. Therefore, the contribution of the column to I zz is deter- 
mined by the expression 

dJ zz , column — 2p c (x 2 -f y 2 ) dx dy 

Integration of this expression with respect to x gives the contribu- 
tion to I zz of the layer of length 2a, width 2c, and thickness dy shown 
in Fig. 5.17: 

+o 

dl zz , layer = j 2pc (x 2 + y 2 ) dx dy 
— a 

+a -fa 

* 2pc dy ^ x z dx + 2pcy 2 dy j dx = ^ y pea 3 + 4pcay 2 J dy (5.42) 
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(the density p does not depend on the coordinates x y y, and z because 
the body is homogeneous). 

Finally, integrating Eq. (5.42) with respect to y, we get 1 22 for 
the entire parallelepiped of mass m: 


+b + b + b 

/„= j (y pea 3 + 4pcay 2 ) dy = y pea 3 j dy + 4pca f y 2 dy = 

-b -b —b 

= y pca 9 6 + y pcai> 3 = y p (2a) (26) (2c) (a 2 + b 2 ) = y- m ( a 2 -f- ft 2 ) 

Similar calculations give I xx = + c 2 ), and = 

= y m (a 2 + c 2 ). 

Now let us calculate one of the centrifugal moments, for instance 
I xy . The contribution to this moment of a column with the base 
dx dy is 

dl x y t column — ~ ~ pxi/2c dx dy 


and the contribution of a layer is 


+o 

dl x y, layer = — 2pcy dy j x dx == 0 

—a 


Accordingly, the moment of the entire parallelepiped equals zero. 
A similar result is also obtained for the other centrifugal moments. 
Thus, when we choose the coordinate axes as shown in Fig. 5.17, 
the inertia tensor of a homogeneous rectangular parallelepiped 
has the form 

°\ 

0 (5.43) 

hi 

(we have retained only one of the two identical subscripts for the 
diagonal components). 

We obtained such a result because we choose the principal axes of 
inertia (see Sec. 5.3) of the parallelepiped as the coordinate axes. 
Upon a different choice of the coordinate axes, the centrifugal mo- 
ments of inertia will differ from zero. The following reasoning will 
convince us that this is true. When we choose the axes as shown in 
Fig. 5.18a, the areas of rectangles 1, 2, 3 , and 4 are the same. On two 
of them, the product xy is positive, and on two negative. As a result, 
the integral of xy taken over the entire area vanishes. When we choose 
the axes as shown in Fig. 5.18b, the areas of the hatched figures 1 
and 3 are less than those of the unhatched figures 2 and 4 (because 
a > b). Therefore, the integral of xy taken over the entire area will 




X o 
0 Iy 
0 0 
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differ from zero. Accordingly, the centrifugal moment I xy also differs 
from zero. 

The result obtained is common for all bodies regardless of their 
shape and mass distribution. If we take the principal axes of inertia 
of a body as the coordinate axes, the inertia tensor has the form given 
by Eq. (5.43). The quantities I x , I y , I z [but not I xxt I yy , I zz in 
Eq. (5.34); upon rotation of the coordinate axes all the tensor com- 
ponents change, the diagonal ones included] are called the principal 



moments of inertia of a body. It must be underlined that the axial 
moments calculated not about arbitrary axes, but about the principal 
ones, are called the principal moments of inertia. 

The principal axes of inertia are mutually perpendicular and in- 
tersect at the centre of mass of a body. In the general case (when 
I x I y =£ / z ), we can choose these axes in a single way. For a 
spherical top (i.e. a body for which I x = I y = I z , see Sec. 5.3), 
the position of the principal axes is absolutely indeterminate. For 
a symmetrical top ( I x = I y =?£= / z ), only the z-axis is fixed, the other 
two axes being indeterminate. 

Assume that a body rotates about one of its principal axes of iner- 
tia, say about the z-axis. Selecting the principal axes as the coordinate 
ones, we have a> z = ©, to* = co„ = 0. Since the inertia tensor has 
the form of Eq. (5.43) when the coordinate axes are chosen in this 
way, Eqs. (5.30) give the following values of the components of the 
angular momentum of a body: 

L x L y 0, L z — I jO) 

Consequently, the vector L has the same direction as to. The same 
result is obtained for rotation of a body about the other principal 
axes. In all these cases, we arrive at Eq. (5.12): 

L = /a) 

where / is the corresponding principal moment of inertia of the body. 
In Sec. 5.3, we obtained Eq. (5.12) for a homogeneous body rotating 
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about its axis of symmetry. Now we have established that this equa- 
tion holds when an arbitrary body rotates about one of its principal 
axes of inertia. 

In conclusion, let us determine when the equation L = M [see 
Eq. (3.118)], which is always correct, can be written in the form 

la = M (5.44) 

We may do this first of all when a body rotates about a principal 
axis, and the moment of the forces M is directed along this axis. 
Indeed, in this case, the moment M produces the increment dL that 
is collinear with L (dL = M dt). Hence, rotation constantly takes 
place about a principal axis so that the relation L = /<*> is never 
violated. In this case, however, Eq. (5.44) gives nothing new in 
comparison with the formula 

Ia z = M z (5.45) 

Here z is the axis of rotation. 

When M is not collinear with L (for example, when M is perpendic- 
ular to L), the axis of rotation moves relative to the body with time. 
Consequently, even provided that the relation L = /<*> is obeyed at 
the initial moment, this relation stops being obeyed with time, and 
Eq. (5.44) loses its meaning. The displacement of the axis of rotation 
relative to the body is of no significance 
only when the body is a spherical top. 

For such a top, any axis is a principal 
one and has the same value of the mo- 
ment of inertia /. Therefore, Eq. (5.44) 
holds for any mutual direction of the 
vectors M and o>. 

5.6. Kinetic Energy of 
a Rotating Body 

Let us begin with a consideration 
of the rotation of a body about a fixed 
axis, which we shall call the z-axis 
(Fig. 5.19). The linear velocity of the 
elementary mass m t is v t = c where i?* is the distance from the 
mass/tt* to the z-axis. Consequently, we get the following expression 
for the kinetic energy of the i-th elementary mass: 


z k 

i 
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The kinetic energy of a body is composed of the kinetic energies of 
its parts: 

2?k = 2 , t — ~2 2 

The sum in the right-hand side of this equation is the moment of 
inertia of the body I z relative to the axis of rotation. The kinetic 
energy of a body rotating about a fixed axis thus equals 

^k=4- / ^ a ( 5 - 46 > 

Assume that the mass m t experiences* the internal force t t and 
the external force F * (see Fig. 5.19). According to Eq. (3.16), these 
forces do the following work during the time dt: 

dAi — tiSidt + FjVjdf = f t [ee, rj dt + Fj [co, rj dt 

Performing a cyclic transposition of the multipliers in the scalar 
triple products [see Eq. (1.34)1 we get 

<L4j = <» [r,, f j ]df + fi>[ r i> Fj dt = a>Mj nti j df + ©Mj df (5.47) 

where M ln t, ; is the moment of an internal force relative to point O, 
and Mj is the similar moment of an external force. 

Summation of Eq. (5.47) for all the elementary masses yields 
the elementary work done on the body during the time dt: 

= = (2 df -|-<o (2 Mj) dt 

The sum of the moments of the internal forces equals zero [see 
Eq. (3.117)1. Consequently, designating the total moment of the 
external forces by M, we get the expression 

dA = ©M dt — (oM z dt (5.48) 

(we have used Eq. (1.21), taking into account that = M J. 
Finally, since co dt is the angle dcp through which the body turns 
during the time dt , we have 

dA = M t dip (5.49) 

The sign of the work depends on that of M z , i.e. on the sign of the 
projection of the vector M onto the direction of the vector co. 

Thus, internal forces do no Work when a body rotates, the work 
of the external forces is determined by Eq. (5.49). 

We can arrive at Eq. (5.49) by taking advantage of the fact that 
the work done by all the forces applied to a body goes to increase 
its kinetic energy [see Eq. (3.11)1. Differentiating both sides of 


* The resultant force f| + Fj is in a plane perpendicular to the axis of ro- 
tation. 
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Eq. (5.46), we obtain 

dE k = i> dm = I t i am dt 

m 

According to Eq. (5.15), I z to = M 2 , and the product co d£ equals 
d<p. Hence, substituting dA for di? k , we arrive at Eq. (5.49). 

Table 5.1 compares the formulas of mechanics of rotation with 
similar formulas of mechanics of translation (mechanics of a par- 
ticle). This comparison shows that in all cases of rotation the part 
of mass is played by the moment of inertia, the part of force by 
the moment of a force, the part of momentum by the angular momen- 
tum, and so on. 

Table 5.1 


Translation 

Rotation 

v= linear velocity 

© = angular velocity 

a = v = linear acceleration 

a = <o = angular acceleration 

m = mass 

I z — moment of inertia 

p = mv = momentum 

L z — I z (o ~ angular momentum 

F = force 

M or M z = moment of force 

p = F 

L = M 

ma = F 

£ 

ii 

N 

E\ = ~2 mv% 

E k = —I<o* (for a fixed axis of rotation) 

dA = F s ds 

dA — M z d<p 


We obtained Eq. (5.46) for the case when a body rotates about 
a stationary axis fixed in the body. Now let us assume that a body 
rotates arbitrarily relative to a fixed point coinciding with its 
centre of mass. We shall rigidly associate a Cartesian system of 
coordinates with the body and place its origin at the centre of mass. 
The velocity of the t-th elementary mass is v* = [orj. Consequently, 
we can write the following expression for the kinetic energy of the 
body: 

= 4 * 2 m i v * z= ~r 2 m t [® r «] 2 =~y 2 m i°> Zr < sin2 <p t 

where <p* is the angle between the vectors <d and r £ . Substituting 
1 — cos 2 <p e * for sin 2 <p* and taking into account that or* cos <p* = 
= G>r*, we have 

£k = j2“i {®M — (*>r t ) 2 ) 
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Let us write out the scalar products through the projections of the 
vectors <o and r t onto the axes of the coordinate system associated 
with the body: 

£k = 4" 2 /»*{(©£+©£ + ©!) + 

— (©**1 + CO„y, + <O z Z t ) (d) x Xi + COyfft + <fl 2 Z,)> = 

= T 2 m i {(®*+®5 4-©I)(*?+y? 4-2*)— 

— <aix\ — (OxCDyXti/i — (0 x <0 2 x,Zi— (Oyat^iXi — (O^yi — 

— (DytirfiZi — (0 Z ( o x z t Xf — a> z (OyZiyi — o>lz|} 

Finally, combining addends with identical products of the angular 
velocity components and putting these products outside the sums, 
we get 

E k =4*{®*2 mi (??+ z ?)+®J 2 'M a: ?+ z f)+ 

+ to? 2 m i ( x i + y <) — 2 m i x tyt — ®*© z 2 m t x &i — 

— O v»* 2 mMiXi— (Oy<a s 2 rn i y i z i — ©*©*2 m&iXt — (*z<*y 2 m i z i!/i} 

The sums by which the products of the angular velocity compo- 
nents are multiplied are the components of the inertia tensor [see 
Eq. (5.41)]. Hence, we have arrived at the equation 

E k = \ {/ x *»x + Ixy^x^y + + ^yx® B ©x + 

+ I yy ©v + lyz^y^z + ? ZX^Z^X + /*„©*©„ + /„©*} (5.50) 

This equation can be written in the form 

^k==x 2 *»®i®» (5.51) 

i, h—x, y, z 

In summation, the subscripts i and k are sequentially given the 
values x y y , z independently of each other. 

If the axes of a coordinate system associated with a body are 
chosen so that they coincide with the principal axes of inertia 
of the body, the centrifugal moments of inertia will vanish, and 
Eq. (5.50) will become simplified as follows: 

* k =4- {/« col + /„©’ + 7,0 of) (5.52) 

Here I x , / y , I z are the principal moments of inertia of the body. 
For a spherical top, these moments have the identical value I so 
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that Eq. (5.52) becomes E * = — / <o 2 [compare with Eq. (5.46)]. 

When an arbitrary body rotates about one of the principal axes of 
inertia, say the z-axis, we have co z = <o, co x = co y = 0, and Eq. (5.52) 
transforms into (5.46). Thus, the kinetic energy of a rotating body 
equals half the product of the moment of inertia and the square 
of the angular velocity in three cases: (1) for a body rotating about 
a fixed axis, (2) for a body rotating about one of the principal axes 
of inertia, and (3) for a spherical top. In all other cases, the kinetic 
energy is determined by more complicated equations (5.50) or (5.52). 


5.7. Kinetic Energy of a Body 
in Plane Motion 

The plane motion of a body can be represented as the superposition 
of two motions — translation with a velocity v 0 and rotation about 
the relevant axis with the angular velocity <o (see Sec. 5.1). By 
Eq. (5.1), the velocity of the i-th elementary mass of a body is 

Vi = v 0 + [©, Tj] 

where v 0 is the velocity of a certain point O of the body, and 
is the position vector determining the position of the elementary 
mass with respect to point O. 

The kinetic energy of the i-th elementary mass is 

&K t = -y m i v « = 4" { v ° + Ifi> ’ r< ^ 2 
Squaring the expression in braces, we get 

£k, i = 4- m iK + 2v #[®> ri]+[©, Til 2 } 

The vector product of cd and r f has a magnitude equal to c d/?*, where 
R t is the distance to the mass m t from the axis of rotation [see 
Fig. 1.33 and the text preceding Eq. (1.100)]. Consequently, the 
third addend in the braces equals o 2 i?|. Let us perform a cyclic 
transposition of the multipliers in the second addend [see Eq. 
(1.34)]. As a result, we obtain 

Ek, t = \ m t K + 2 l v o» <*»1 r i + © 2 #*} (5.53) 

To obtain the kinetic energy of a body, we find the sum of Eq. (5.53) 
for all the elementary masses, putting the constant factors outside 
the sum: 
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The sum of the elementary masses 2 m i the mass of the body m. 
The expression 2 m i T t is the product of the mass of the body and 
the position vector r c of the centre of mass of the body. Finally, 
2 rriiRl is the moment of inertia of the body J 0 relative to an axis 
passing through point 0. We can therefore write that 

E k = -^-nw* 0 + mrc [v 0 , ©1 + 4" 7 o©* (5.54) 

If we take the centre of mass of the body as point 0, the position 
vector rc will equal zero, and the second addend will vanish. Con- 
sequently, designating by v c the velocity of the centre of mass, and 
by Ic the moment of inertia of the body relative to an axis passing 
through point C, we get the following expression for the kinetic 
energy of the body: 

= + (5.55) 

Thus, the kinetic energy of the body in plane motion consists of 
the energy of translation with a velocity equalling that of the centre 
of mass and the energy of rotation about an axis passing through 
the centre of mass of the body. 

5.8. Application of the Laws 
of Dynamics of a Body 

The motion of a rigid body is described by two equations that 
have already been given in previous sections: 

mac = 2 F ext (5.6) 

L = 2 Mext (3.118) 

The motion of a body is thus determined by the external forces and 
the moments of these forces acting on it. 

The moments of the forces may be taken relative to any point that 
is stationary or moving without acceleration. If we took the moment 
of the external forces relative to a point moving with acceleration, 
we would in essence write Eq. (3.118) in a non-inertial reference 
frame. In this case, we must take into consideration the forces of 
inertia and their moments apart from the external forces due to the 
interaction of the given body with other bodies. 

The points of application of the forces acting on a body may be 
transferred along the lines of action of the forces because neither 
the sum of the forces nor their moments will change when this is 
done (when a force is transferred along the line of its action, the 
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moment arm relative to any point remains unchanged). This per- 
mits us to replace several forces with a single one equivalent to 
them in its action on a body. For example, the two forces F x and F 2 
in one plane (Fig. 5.20) may be replaced with the force F equiva- 
lent to them. The point of application of the latter may also be 
chosen arbitrarily on the direction of its 
action. 

A combination of parallel forces acting 
on a body may be replaced with their 
resultant equal to the sum of all the 
forces and applied to a point of the 
body such that its moment equals the 
sum of the moments of the separate 
forces. 

Let us find the resultant of the forces 
of gravity. These forces are applied to 
all the elements of a body, the force Fig* 5.20 

m t g acting on the elementary mass m t . 

The sum of these forces is P = mg , where m = m t is the mass 
of the body. The total moment of the forces of gravity relative to 
a certain point O is 

M = 2 fri, (^ig)l 

where r* is the position vector determining the position of the mass m t 
with respect to point 0. Transferring the scalar multiplier m t from 
the second member of the product to the first one and then putting 
the common factor g outside the sum, we get 

M = [(2 m-iT,), g] 

The sum in parentheses equals the product of the mass of the body 
and the position vector rc of the centre of mass C. Hence, 

M = [(mr c ), gl = [rc, {mg)] = [rc, Pi (5.56) 

Thus, the total moment of the forces of gravity relative to an arbit- 
rary point 0 coincides with the moment of the force mg applied to 
point C. Thus, the resultant of the forces of gravity equals P = mg 
and is applied to the centre of mass of the body. We must note that 
this holds only when the field of the forces of gravity is homogeneous 
within the body [in deriving Eq. (5.56) we considered that g =* 
= constl. 

It follows from Eq. (5.56) that the moment of the forces of gravity 
relative to the centre of mass equals zero (in this case r c = 0). 
The point relative to which the moment of the forces of gravity 
equals zero is called the centre of gravity of the body. Thus, when 
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the field of gravity forces is homogeneous within a body, the centre 
of gravity coincides with the centre of mass. 

For a homogeneous gravitational field, the forces of gravity 
applied to different elementary masses have an identical direction 
and are proportional to m t . The forces of inertia produced in a non- 
inertial reference frame moving in a straight line relative to inertial 
frames have the same property. Indeed, in this case, the forces 
of inertia applied to the elementary masses m t equal — m t a 0 , where 
a 0 is the acceleration of the non-inertial frame [see Eq. (4.2)]. By 
repeating the reasoning that led us to Eq. (5.56) (here — m^ao must 
be substituted for m*g), we can show that the resultant of the inertia 
forces equals — ma 0 and is applied to the centre of mass of 
the body. It must be stressed that this holds only for reference 
frames moving in a straight line. 

The moment of the inertia forces relative to the centre of mass 
equals zero (in a frame with translational motion). Therefore, when 
compiling Eq. (3.118) for the moments taken relative to the centre 
of mass, the forces of inertia do not have to be taken into consid- 
eration. 

Let us find the conditions of equilibrium of a rigid body. A body 
can remain in a state of rest if nothing causes the appearance of 
translation or rotation. According to Eqs. (5.6) and (3.118), two 
conditions are essential and sufficient in this case: 

(1) the sum of all the external forces applied to a body must 
equal zero: 

SF eit = 0 (5.57) 

(2) the resultant moment of the external forces relative to any 
point must equal zero: 

2M eit = 0 (5.58) 

When condition (5.57) is obeyed, from the equality to zero of the 
sum of the moments for one point O we get the equality to zero of 
the sum of the moments relative to any other point O' . Indeed, 
assume that for a certain point O we have 

2 Ml = Sir*. Fil = 0 (5.59) 

Let us take another point O' whose position relative to O is deter- 
mined by the vector b. Examination of Fig. 5.21 shows that rl = 
= r i — b. Consequently, the sum of the moments relative to point 
O' is 

2 Mi = 2 W, F/] = 2 [(r* - b), Fd = 2 f r ,» Fd - 2 [b, Fd 

According to Eq. (5.59), the first sum equals zero. Factoring out 
the constant quantity b in the second sum, we get the expression 
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— [b, 2^*1 which in view of Eq. (5.57) also vanishes. Thus, from 
Eq. (5.57) and condition (5.59) for point O, we get condition (5.59) 
for point O ' . 

It must be noted that the vector condition (5.58) is equivalent 
to three scalar ones: 

2 M ext = 0, 2 ^*y. ex t ~ 0, 2-^z,ext = 0 (5.60) 

Thus, the conditions of equilibrium of a rigid body are deter- 
mined by Eqs. (5.57) and (5.58), or by Eqs. (5.57) and (5.60). 



Fig. 5.21 


Fr 



In conclusion, let us consider an example of the application of 
the laws of dynamics of a rigid body. Assume that a homogeneous 
cylinder of radius R and mass m rolls down an inclined plane 
(Fig. 5.22) without slipping. The angle of inclination of the plane 
is p and its height is h (h^> R). The initial velocity of the cylinder 
is zero. We are to find the velocity of the centre of mass and the 
angular velocity of the cylinder at the moment when it reaches 
the horizontal section. We shall give two variants of the solution. 

First Variant. The cylinder will move under the action of three 
forces — the force P = mg, the force of friction F fr , and the force 
of normal pressure F n (see Sec. 2.12). The acceleration of the cylin- 
der in the direction of a normal to the plane is zero. Consequently, 
the magnitude of the force of normal pressure equals the normal 
component of the force P having the magnitude mg cos p. 

Friction appears between the cylinder and the plane at the points 
of their contact. In the absence of slipping, these points of the 
cylinder are stationary (they form an instantaneous axis of rota- 
tion). Hence, the force of friction we are dealing with is a static 
force of friction. We know from Sec. 2.10 that the static force of 
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friction can range from zero to the maximum value F 0 that is deter- 
mined by the product of the coefficient of friction and the force of 
normal pressure pressing the contacting bodies against each other 
(F 0 = fmg cos P). In the case under consideration, the force of 
friction takes on a value such that slipping will be absent. Slipping 
will be absent when the cylinder rolls along the plane provided that 
the linear velocity of the points of contact vanishes. This will occur, 
in turn, if the velocity of the centre of mass i? G at each moment 
of time equals the angular velocity of rotation of the cylinder <o 
multiplied by the radius of the cylinder R: 

v G = coif (5.61) 

The acceleration of the centre of mass a c will accordingly equal 
the angular acceleration a multiplied by R: 

a c = a R (5.62) 

If the force of friction needed to obey conditions (5.61) and (5.62) 
does not exceed the maximum value F 0 , then the cylinder will roll 
down the plane without slipping. Otherwise rolling without slipping 
is impossible. 

Equation (5.6) in the given case has the form 

ma G = mg + F fr + F n 

Projecting it onto the direction of motion, we get 

ma c = mg sin P — F tr (5.63) 

For a homogeneous cylinder rotating about an axis of symmetry, 
L = /<*>. Therefore, Eq. (3.118) can be written in the form 

la = 2 M z (5.64) 

where z is the axis of the cylinder [see Eq. (5.15)]. In Eq. (5.64) 
written relative to the axis of the cylinder, only the moment of the 
force of friction will differ from zero. The remaining forces including 
the resultant of the forces of inertia are directed through the axis 
of the cylinder. As a result, their moments relative to this axis 
equal zero. Thus, Eq. (5.64) will be written as follows: 

la = RFn (5 .65) 

Here / is the moment of inertia of the cylinder relative to its axis 
equal to mR 2 / 2. 

Equations (5.63) and (5.65) contain three unknown quantities, 
F tT , ac and a. The last two of them are related by Eq. (5.62) result- 
ing from the absence of friction. By solving the system of equations 
(5.62), (5.63), and (5.65), we shall find (with account of the fact 
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that I = mJR 2 !2) the values of the required quantities: 


F n = mg sin p 

(5.66) 

<*c = -§-g sinP 

(5.67) 

a== T(ir) sinp 

(5.68) 


Now that we know the value of the static force of friction needed 
for rolling down of the cylinder without slipping, we can find the 
condition at which this rolling is possible. For the cylinder to roll 
down without slipping, the force (5.66) must not exceed the maxi- 
mum value of the static force of friction F 0 equal to /rage os P: 

I 

-y mg sin p < /rag cos p 


whence 

tan p 3/ 


Consequently, if the slope (tan P) of the plane exceeds the triple 
value of the static coefficient of friction between the cylinder and 
the plane, rolling down cannot occur without slipping. 

From the constancy of a c [see Eq. (5.67)1 it follows that the 
centre of mass of the cylinder moves with uniform acceleration. 
During the time t T that it rolls down, the cylinder travels the dis- 
tance h! sin p. In uniformly accelerated motion, the distance, accele- 
ration, and time are related by the equation s = at 2 /2. Introducing 
the value of s , we get 


h 

sin p 


2 


a c t 2 r 


whence, introducing the value of a c from Eq. (5.67), we have 


t T 


1 

sin p 



This time, like does not depend on the mass and radius of the 
cylinder*. It is determined only by the angle of inclination of the 
plane P and the difference between the levels of its edges h . 

The velocity of the centre of mass when the cylinder reaches the 
horizontal section will be 


i>c = 



♦ This holds only for a homogeneous solid cylinder. 
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and the angular velocity of the cylinder will be 



We must note that the static force of friction does no work on the 
cylinder because the points of the cylinder which this force is applied 
to are stationary at each moment of time [see Eq. (3.16)]. 

We find for the horizontal plane (|5 = 0) by Eqs. (5.67) and (5.68) 
that the cylinder will travel without acceleration ii it is first imparted 
a certain translational velocity and the corresponding (such that 
no slipping occurs) angular velocity. The motion will actually be 
retarded. This is due to the force of rolling friction which is directed 
so that its moment reduces the angular velocity go, while the force 
itself produces a corresponding (again such that no slipping will 
appear) retardation of the centre of mass. The force of rolling fric- 
tion does negative work on a rolling body. 

In solving the problem on the rolling of a cylinder down an in- 
clined plane, we disregarded rolling friction. 

Second Variant. Since the force of friction does no work (we dis- 
regard rolling friction), the total energy of the cylinder remains 
constant. At the initial moment, the kinetic energy is zero, and the 
potential energy is mgh. At the bottom of the inclined plane, the 
potential energy vanishes but a kinetic energy appears equal to 
[see Eq. (5.55)1: 



Since slipping is absent, v c and to are related by the expression 
v c = © R. Introducing to = vJR and / c = mil 1 12 into the expres- 
sion for the kinetic energy, we get 





The total energy at the beginning and end of rolling down the 
inclined plane must be the same: 

mvc = mgh 


whence 



and the angular velocity is 

— JF—rV'r* 

Pay attention to how much simpler the second variant of solution 
is than the first one. 
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5.9. Gyroscopes 

A gyroscope (or top) is a massive symmetrical body rotating with 
a great velocity about an axis of symmetry. We shall call this axis 
the axis of the gyroscope. It is one of the principal axes of inertia. 
Therefore, if it does not turn in space, the angular momentum is 
L = /co, where / is the moment of inertia relative to the gyroscope 



Fig. 5.23 Fig. 5.24 


axis. Let us now assume that the gyroscope axis rotates with a cer- 
tain velocity co'. In this case, the resultant rotation of the gyroscope 
occurs about an axis not coinciding with an axis of symmetry, and 
the direction of the vector L does not coincide with that of the 
gyroscope axis. If the angular velocity co' of the axis is negligibly 
small in comparison with the angular velocity co of the gyroscope 
itself, however (co' <C co), then we may assume that the vector L 
is approximately equal to /co and is directed along the gyroscope 
axis. In this condition, rotation of the vector L and rotation of the 
gyroscope axis will be equivalent. We shall assume in the following 
that the condition co' C co is obeyed. 

When an attempt is made to turn the gyroscope axis, a distinctive 
phenomenon is observed called the gyroscopic effect: under the 
action of forces that ought to cause rotation of the gyroscope axis 00 
about straight line O'O' (Fig. 5.23), the axis turns about straight 
line 0"0" (axis 00 and straight line O' O' are in the plane of the 
drawing, and straight line 0"0" and the forces F x and F 2 are at 
right angles to this plane). The behaviour of the gyroscope, which 
seems unnatural at first sight, completely conforms with the laws 
of rotational dynamics. Indeed, the moment of the forces F x and F* 
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is directed along straight line O'O' . During the time dt, the angular 
momentum of the gyroscope L receives the increment dh = M dt , 
which has the same direction as M. After the time dt elapses, the 
angular momentum of the gyroscope will equal the resultant L' = 
= L -f- dh in the plane of the figure. The direction of the vector 1/ 
coincides with the new direction of the gyroscope axis. Thus, the 
latter will turn about straight line O n O " through a certain angle 
dtp. It can be seen from Fig. 5.23 that dtp = | dh \/L = M dt/L . 
Hence it follows that the gyroscope axis turned to its new position 
with the angular velocity co' = dtp/dt = MIL. Let us write this 
relation in the form M — co'L. The vectors M, L, and co' are mutually 
perpendicular (the vector co' is directed along straight line 0"0* 
toward us). The relation between them can therefore be written 
in the form 

M - [co'L] (5.69) 

We have obtained this equation for the case when the vectors co' 
and L are mutually perpendicular. It also holds, however, in the 
most general case. A glance at Fig. 5.24 shows that when the gyro- 
scope axis turns about the vector co' through the angle dtp the vector L 
receives an increment whose magnitude is | dh | = L sin a dtp. 
At the same time | dh | = M dt. Thus, L sin a dtp = M dt, whence 
M = co'Z/ sin a. It is easy to see with the aid of Fig. 5.24 that 
in this case Eq. (5.69) holds (the vectors co' and L are in the plane 
of the figure, the vector dh is directed beyond the drawing and is 
therefore depicted by a circle with a cross in it). We remind our 
reader that Eq. (5.69) is correct only if co' <C co. 

When attempts are made to cause the axis of a gyroscope to turn 
in a given way, the so-called gyroscopic forces are set up owing 
to the gyroscopic effect. These forces act on the bearings in which 
the gyroscope axis rotates. For example, if gyroscope axis 00 is 
forcibly turned about straight line O'O' (Fig. 5.25), the gyroscope 
axis tends to turn about straight line O n O" . To prevent this rota- 
tion, the forces F' and F' acting from the side of the bearings must 
be applied to the gyroscope axis. According to Newton’s third law, 
the gyroscope axis will act on the bearings with the forces F x and F 2 , 
and the latter are exactly the gyroscopic forces. Upon forced turning 
of the gyroscope axis with the angular velocity co', the moment 
of the forces with which the bearings act on the axis is determined 
by Eq. (5.69). The moment of the gyroscopic forces with which 
the axis acts on the bearings is 

M' = [Lco'l (5.70) 

Let us assume that the axis of a gyroscope is fixed in ring R that 
can freely turn in frame Fr (Fig. 5.26). Let us turn the frame about 
an axis in its plane with the angular velocity co'. In this case, as we 
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have found out, a moment of gyroscopic forces determined by 
Eq. (5.70) is produced that acts on the ring. This moment will cause 
the ring to turn in the frame in the direction indicated by the arrow 
until the gyroscope axis becomes arranged in the direction of the 
axis of rotation of the frame and the moment (5.70) vanishes. The 
direction of rotation of the gyroscope itself and the direction in which 



xs, 

/ 

/ 




t— f; 


Fig. 5.25 



the frame turns will coincide. When L and to' are directed opposite- 
ly, the moment (5.70) also vanishes. The corresponding position 
of the gyroscope axis, however, will be unstable — upon the slightest 
deviation of the angle between L and a>' from 180 degrees, the mo- 
ment M' will be set up that will turn the axis until this angle be- 
comes equal to zero. 

Now let us assume that the frame turns with the angular velocity 
<o' about an axis not in its plane (Fig. 5.27). In the position of the 
ring at which the angular momentum of the gyroscope L is perpen- 
dicular to co' (Fig. 5.27a), the vector M' has the direction shown 
in the figure. The component M' x of this vector causes the ring to 
turn in the frame, and as a result the angle between the vectors L 
and (o' will diminish. The component Mj'j tends to misalign the 
ring relative to the frame. When the ring occupies a position such 
that the angle between the vectors L and co' takes on the smallest 
possible value (Fig. 5.27£>), the component will vanish because 
in this case the moment of the gyroscopic forces M' is in the plane 
of the ring; this moment cannot produce rotation of the ring in the 
frame. Thus, under the action of gyroscopic forces, the ring occupies 
such a position in the frame in which the angle between the gyroscope 
axis and the axis of rotation of the frame is minimum. 
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An instrument called the gyrocompass (gyroscopic compass) is 
based on the behaviour of a gyroscope described above. This instru- 
ment is a gyroscope whose axis can freely turn in a horizontal plane. 
The Earth’s daily rotation causes the axis of the gyrocompass to 
arrange itself in a position such that the angle between this axis 
and the Earth’s axis of rotation will be minimum (Fig. 5.28). In 
this position, the axis of the gyrocompass will be in a meridian 
plane and, consequently, line up in a north-south direction. A gyro- 
compass advantageously differs from its magnetic pointer counter- 
part in that no corrections have to be introduced into its readings 



Fig. 5.27 


for the so-called magnetic declination (the angle between the magnet- 
ic and the geographic meridians). Another advantage is that no 
measures have to be taken to compensate for the action on the point- 
er of ferromagnetic objects near the compass (for example, the 
steel hull of a ship). 

Assume that the axis of a gyroscope can freely turn about point O 
(Fig. 5.29). Let us consider the behaviour of such a gyroscope in the 
field of forces of gravity. The magnitude of the moment of the forces 
applied to the gyroscope is 

M — mgl sin a (5.71) 

where m = mass of the gyroscope 

l = distance from point O to the centre of mass of the 
gyroscope 

a — angle made by the gyroscope axis with a vertical line. 

The vector M is perpendicular to the vertical plane passing through 
the gyroscope axis (this plane is hatched in Fig. 5.29). 
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Under the action of the moment M, the angular momentum L 
changes during the time dt by the increment dh = M dt perpendic- 
ular to the vector L. The amount by which the vector L changes 
upon receiving the increment dh corresponds to turning of the gyro- 
scope axis such that the angle a does not change. The vertical plane 
passing through the gyroscope axis turns through the angle dqn 




The vector M turns through the same angle in a horizontal plane. 
As a result, when the time dt elapses, the vectors L and M will have 
the same mutual arrangement as at the initial moment. 

During the next moment dt , the vector L again receives the incre- 
ment dh that is perpendicular to the new direction of the vector L 
setting in after the preceding elementary turn, etc. As a result, the 
gyroscope axis will rotate about the vertical axis passing through 
point O with the angular velocity o'. It will describe a cone with an 
apex angle of 2a (compare with Fig. 5.24). (When a = ji/ 2, the cone 
degenerates into a plane.) The vector L will change only in direction. 
Its magnitude will be constant because the elementary increments 
dh will always be perpendicular to the vector L*. 

Thus, in the field of forces of gravity, the axis of a gyroscope with 
a fixed point rotates about a vertical line, describing a cone. Such 
motion of a gyroscope is called precession. We can find the angular 


* We can find similar behaviour in the velocity vector when a point moves 
uniformly along a circle. The vector v receives the increment d\ = a n dt ( a n = 
= const) during the time dt. As a result, the direction of the vector v changes, 
while its magnitude remains constant. 
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velocity <o' of precession if we take into account that by Eq. (5.69) 
M = (o'L sin a. Equating this value to Eq. (5.71), we get 
co'L sin a = mgl sin a, whence 


/ mgl mgl 

10 ToT 


( 5 . 72 ) 


It follows from Eq. (5.72) that the velocity of precession does not 
depend on the angle of inclination of the gyroscope axis with respect 
to a vertical line (on the angle a). 


znflV 


(<z) (6) (c) 

Fig. 5.30 

We have considered the approximate theory of the gyroscope. 
According to the strict theory, rotation of the axis about a vertical 
line is accompanied by oscillations of the axis in a vertical plane. 

The latter are attended by changes in the 
angle a ranging from a x to a 2 . This wob- 
bling of the axis is called nutation. Depend- 
ing on the initial conditions, the end of 
a gyroscope axis draws one of the curves 
depicted in Fig. 5.30 on an imaginary 
spherical surface. If, for example, after 
positioning the axis at the angle a 1? we 
make the gyroscope rotate and then release 
the axis gently, the latter will first lower 
while rotating about the vertical line. After 
reaching the angle oc 2 , the axis will begin to 
rise, and so on (this case is shown in 
Fig. 5.306). 

By imparting an initial impetus of a quite 
definite magnitude and direction to a gyro- 
scope, we can achieve precession of its axis 
without nutation. Such precession is defined as regular. The ampli- 
tude of nutation diminishes with an increasing gyroscope velocity 
of rotation. Nutation is also absorbed by friction in the support. 
This is why nutation is often unnoticeable in practice. Precession 
that is only approximately regular is called pseudoregular. 
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If point O is at the centre of mass of a gyroscope (see Fig. 5.29), 
the moment of the force of gravity becomes equal to zero, and we 
get the so-called free symmetrical top. Owing to the law of conser- 
vation, the angular momentum of such a top will change neither 
in magnitude nor in direction. If we rotate the top about its axis 
of symmetry, the vectors L and (o will have the same direction 
which remains constant for an infinitely long time. If, however, 
the top is rotated about an axis not coinciding with any of its prin- 
cipal axes of inertia, the vectors L and co will not coincide (Fig. 5.31). 
The relevant calculations give us the following results. The vector ce 
remains constant in magnitude and precesses about the direction 
of the vector L describing a cone. At the same time, the axis of 
symmetry z of the top precesses, the vectors L and o and the z-axis 
constantly being in one plane. The top rotates about the z-axis 
with the angular velocity — LJI z , where L z is the projection 
of the vector L onto the z-axis, and I z is the moment of inertia of 
the top relative to this axis. The angular velocity of precession 
is cD pr = L/J, where / is the identical value of the moments of 
inertia I x and I y . 



CHAPTER 6 UNIVERSAL 

GRAVITATION 


6.1. Law of Universal Gravitation 

All bodies in nature mutually attract one another. The law which 
this attraction obeys was established by Newton and is called the 
law of universal gravitation. This law states: the force with which 
two point particles attract each other is proportional to their masses 
and inversely proportional to the square of the di tance between them’. 

F = G Hjp_ ( 6 . 1 ) 

Here G is a constant of proportionality called the gravitational 
constant. The force is directed along the straight line passing 
through the interacting particles (Fig. 6.1). 



The force with which the second particle attracts the first one 
can be written in the vector form as follows: 

F 12 = G-^e 12 (6.2) 

The symbol e 12 stands for a unit vector directed from the first 
particle to the second one (see Fig. 6.1). Substituting the vector e 21 
for the vector e i2 in Eq. (6.2), we get the force F 2 i acting on the 
second particle. 

To find the force of interaction of extended bodies, they must 
be divided into elementary masses Arci, each of which can be as- 
sumed to be a point particle (Fig. 6.2). According to Eq. (6.2), the 
i-th elementary mass of body 1 is attracted to the Jfc-th elementary 
mass of body 2 with the force 


F ih = G 


Am/Amft 

Gik 


where r ih is the distance between the elementary masses. 


(6.3) 



Universal Gravitation 


173 


Summation of Eq. (6.3) over all the values of the subscript k 
gives the force exerted by body 2 on the elementary mass Am, belong- 
ing to body 1: 

( 6 . 4 ) 

k ik 

Finally, summation of Eq. (6.4) over all the values of the subscript i, 
i.e. summation of the forces applied to all the elementary masses 
of the first body gives the force exerted by body 2 on body 1 : 

F„-2 2 a£2g=i-', h ( 6 . 5 ) 

i k lk 


Summation is performed over all the values of the subscripts i 
and k. Consequently, if body 1 is divided into N,, and body 2 into 
N 2 elementary masses, then the sum (6.5) will contain N X N 2 addends. 

In practice, the summation according to Eq. (6.5) consists in 
integration and, generally speaking, is a very complicated mathe- 
matical problem. If the interacting bodies are homogeneous and 
have a regular shape, the calculations are greatly simplified. In 
particular, when the interacting bodies are homogeneous* spheres, 
calculation by Eq. (6.5) leads to Eq. (6.2), m, and m 2 now being 
the masses of the spheres, r the distance between their centres, 
and e 12 a unit vector directed from the centre of the first sphere 
to that of the second one. The spheres thus interact like point par- 
ticles of masses equal to those of the spheres and situated at their 
centres. 

If one of the bodies is a homogeneous sphere of a very great radius 
(for example, the Earth), while the second body can be considered 
as a point particle, then their interaction is described by Eq. (6.2) 
in which r stands for the distance from the centre of the sphere to 
the particle (this statement will be proved in the following section). 

The dimension of the gravitational constant in accordance with 
Eq. (6.1) is 


[G] 


[£ 1 [ r »] 

[m»] 


(ML/T») L* = L 8 M -i X - 2 


The numerical value of G was determined by measuring the force 
with which two bodies of known mass attract each other. Great 
difficulties appear in such measurements because the forces of 
attraction are extremely small for bodies whose masses can be 


* It is sufficient for the distribution of the mass within the limits of each 
sphere to have central symmetry, i.e. for the density to be a function only of the 
distance from the centre of the sphere. 



174 


The Physical Fundamentals of Mechanics 


measured directly. For example, two bodies each having a mass of 
100 kg and one metre apart interact with a force of the order of 
10~ 6 N, i.e. about 10" 4 gf. 

The first successful attempt to determine G was its measurement car- 
ried out by Henry Cavendish (1731-1810) in 1798. He used the very 
sensitive torsion balance method (Fig. 6.3). Two lead spheres m (each 
of mass 0.729 kg) fastened to the ends of a light rod were placed near 

symmetrically arranged spheres M (each of 


Adjusting head 



mass 158 kg). The rod was suspended on an 
elastic torsion fibre. Twisting of the latter 
was measured, and its magnitude showed the 
force of attraction between the spheres. The 
top end of the fibre was fastened in an ad- 
justing head whose turning made it possible 
to change the distance between the spheres 
m and M. The value 

G = 6.670 X 10-* 1 m 3 / (kg • s 2 ) 

(or N • m 2 /kg 2 ) 


Fig. 6.3 is considered to be the most accurate of all 

the values determined in different ways. 

If in Eq. (6.1) we assume that m 2 , and r equal unity, then 
the force numerically equals G. Thus, two spheres each having 
a mass of 1 kg whose centres are 1 m apart attract each other with 
a force of 6.670 X 10 ~ n N. 


6.2. Gravitational Field 

Gravitational interaction is carried out through a gravitational 
field. Every body changes the properties of the space surrounding 
it— it sets up a gravitational field in this space. The field manifests 
itself in that another body placed in it experiences a force. The 
“intensity” of a gravitational field can obviously be assessed according 
to the magnitude of the force acting at a given point on a body of 
unit mass. Accordingly, the quantity 



is called the gravitational intensity, or the gravitational field vector. 
In Eq. (6.6), F is the gravitational force acting on a point particle 
of mass mat a given point of the field. _ 

The dimension of g' coincides with that of acceleration. The 
intensity of the gravitational field near the Earth’s surface equals 
the acceleration of free fall g (with an accuracy up to the correction 
due to the Earth’s rotation, see Sec. 4.2). 
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It is easy to conclude from Eq. (6.2) that the intensity of the 
field set up by a point particle of mass m is 

g'=-G-^e r (6.7) 

where e r is the unit vector of the position vector drawn from the 
particle to the given point of the field, and r is the magnitude of 
this position vector. 

Assume that a gravitational field is produced by a point particle 
of mass m fixed at the origin of coordinates. Hence, the following 
force will act on a particle of mass m' at a point with the position 
vector r: 

F = gW=— C-^e r (6.8) 

[compare with Eq. (3.120)]. We showed in Sec. 3.13 that the poten- 
tial energy of the particle m' is determined in this case by the equa- 
tion 

£ p = -G-^y- (6.9) 

(the potential energy is assumed to vanish when r = °o). Equation 
(6.9) can also be interpreted as the mutual potential energy of the 
point particles m and m ’ . 

Inspection of Eq. (6.9) shows that to each point of the field prod- 
uced by the particle m there corresponds a definite value of the 
potential energy which the particle m' has in this field. Consequently, 
the field can be characterized by the potential energy which a par- 
ticle of mass m' = 1 has at the given point. The quantity 

<f = 4r ( 6 . 10 ) 

is called the potential of the gravitational field. In this equation, 
2? p is the potential energy which a point particle of mass m' has 
at a given point of the field. 

Knowing the potential of a field, we can calculate the work done 
on the particle m' by the forces of the field when moving it from 
position 1 to position 2 . According to Eq. (3.30), this work is 

A| 2 = i 2 = m ' (^i T 2 ) (6.11) 

According to Eqs. (6.6) and (6.10), the force acting on the particle 
m' is F = m'g', and the potential energy of this particle is E v = 
= m\ p. By Eq. (3.31), we have F = — V# P , i.e. m'g' — — V(w'<p)* 
Putting the constant m' outside the gradient sign and then cancelling 
this constant, we arrive at a relation between the intensity and 
potential of a gravitational field: 

g' = — Vq> 


( 6 . 12 ) 
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Let us find an expression for the mutual potential energy of 
a homogeneous spherical layer and a point particle of mass m . 
We shall consider two cases corresponding to the particle being 
outside and inside the layer, and shall begin with the former one 
(Fig. 6.4a). Let us separate from the layer a ring whose edges cor- 
respond to the values of the angle 0 and 0 + <30. The radius of this 
ring is a sin 0, and its width is a <30 (here a is the radius of the layer). 
Hence, the area of the ring is determined by the expression 
2jt a 2 sin 0 <30. If the thickness of the layer is da and its density is p t 



(a) (b) 

Fig. 6.4 


then the mass of the ring is 2npa 2 da sin 0 <30. All the points of the 
ring are at the same distance r' from m. Consequently, by Eq. (6.9), 
the mutual potential energy of the ring and the mass m is determined 
by the expression 


dEp = — G 


2npa* da sin 0 d0*m 

7 


(6.13) 


To obtain the potential energy of the entire spherical layer and 
the mass m, we must integrate Eq. (6.13) with respect to the angle 0 
within the limits from 0 to n. Here the variable r' varies within 
the limits from r — a to r + a, where r is the distance from the 
centre of the layer O to m. Equation (6.13) contains two related 
variables, namely, 0 and r'. We must exclude one of these variables 
prior to integration. The latter is simplified if we exclude the vari- 
able 0. We can obtain the relation between 0 and r' by using the 
theorem of cosines. Inspection of Fig. 6.4 shows that 

r' 2 s= a 2 -f- r 2 — 2 ar cos 0 


Differentiation of this expression yields 


2r' dr — 2 ar sin 0 <30 


Hence, sin 0 <30 = (r'/ar) <3r'. Making such a substitution in Eq. (6.13), 
we get 


dEp = — G 


2 npa da-m- dr 9 
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Integration with respect to r' within the limits from r[ = r — a 
to r\ = r + a yields 

r+a 

dE p , lay = — Q | dr' =—G p4n ^ dam (6.14) 

r— a 

The expression 4na 2 da gives the volume of the layer, and p4na 2 da 
its mass dM. Thus, the mutual potential energy of the sphere layer 
and the mass m is 

dE Pi i a y = G — (6.15) 


where r is the distance from the centre of the layer to m . 

All our calculations remain the same for the case when the mass 
is inside the layer (see Fig. 6.46). Only the integration limits in 
Eq. (6.14) will differ because r' changes in this case from r' = 
= a — r to r' = a + r. Consequently, 

a+r 

dEp k lay = —G 2jtpa ^ a m . j dr' = — GpAnada.m^ 

a— r 


G-- a m (6.16) 


Hence, in this case the potential energy is the same for all r’s and 
equals the value obtained in Eq. (6.15) for r = a. 

Equation (6.15) can be interpreted as the potential energy of the 
particle m in the field set up by a sphere layer of mass dM . The 
derivative of this energy with respect to r taken with the opposite 
sign equals the projection onto the direction r of the force acting 
on the particle: 

dF r =-^-=-G-^- (6.17) 


The minus sign indicates that the force is directed toward the dimin- 
ishing of r, i.e. to the centre of the layer. 

It follows from Eq. (6.17) that the sphere layer acts on the par- 
ticle with the same force that would be exerted on it by a point 
particle of a mass equal to that of the layer and placed at the centre 
of the latter. 

Equation (6.16) does not depend on the coordinates of a particle. 
Therefore, the gradient of this function vanishes for all r’s less 
than a. Thus, no force acts on a particle inside the layer. Every 
element of the layer naturally exerts a certain force on the particle, 
but the sum of the forces exerted by all the elements of the layer 
equals zero. 
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Now let us consider a system consisting of a homogeneous sphere 
of mass M and a point particle of mass m . Let us divide the sphere 
into layers of mass dM. Each layer acts on the particle with a force 
determined by Eq. (6.17). Summation of this expression over all 
the layers gives the force exerted on the particle by the sphere: 

F r = j dF r = — j G dM ri m = —G-¥p- (6.18) 

The action of the sphere on the particle is equivalent to the action 
of a point particle of a mass equal to that of the sphere and placed 
at its centre (see the preceding section). 

If we take a sphere with a spherical space inside, then no force 
will act on a particle in this space. 

Summation of Eq. (6.15) over all the layers of a solid or a hollow 
sphere yields the mutual potential energy of a particle and a sphere: 

E p = —G (6.19) 

Here M = mass of the sphere 
m = mass of the particle 

r = distance from the particle to the centre of the sphere. 

It follows from Eqs. (6.18) and (6.19) that the gravitational 
field produced by a homogeneous sphere is equivalent (outside the 
sphere) to the field produced by a point particle of the same mass 
at the centre of the sphere. 

Let us consider two homogeneous spheres of masses M x and M 2 . 
The second sphere experiences the same action on the part of the 
first one as would be exerted by a point particle of mass M x at the 
centre of the first sphere. According to Newton’s third law, the 
corresponding force is equal in magnitude to the force that the 
second sphere would exert on the particle M x . By Eq. (6.18), the 
magnitude of this force is GM^M^jr 2, . We have thus proved that 
homogeneous spheres interact like point particles at their centres. 


6.3. The Equivalence Principle 

Mass comes up in two different laws — in Newton’s second law 
and in the law of universal gravitation. In the former case, it char- 
acterizes the inertial properties of bodies, and in the latter their 
gravitational properties, i.e. the ability of bodies to attract one 
another. In this connection, the question arises whether we ought 
to distinguish the inertial mass m ln and the gravitational mass m s . 

This question can be answered only by experiments. Let us con- 
sider the free falling of bodies in a heliocentric reference frame. 
Any body near the Earth’s surface experiences a force of attraction 
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to the Earth that by Eq. (6.18) is 


F = G 


m g M& 


where m g = gravitational mass of a given body 
Me = gravitational mass of the Earth 
i?E = radius of the Earth. 

This force causes the body to acquire the acceleration a (but not g, 
see Sec. 4.2) that must equal the force F divided by the inertial 
mass of the body m ln : 


F q Me m s 

m \n m \n 


( 6 . 20 ) 


Experiments show that the acceleration a is the same for all 
bodies (it was shown in Sec. 4.2 that the identical values of a follow 


Vertical 




from the identical values of g). The factor G (Me/Re) is also the 
same for all bodies. Consequently, the ratio m g hn in is the same for 
all bodies too. All other experiments in which the difference between 
the inertial and the gravitational masses could manifest itself lead 
to a similar result. 

We shall describe the experiment of R. Eotvos, which he began 
in 1887 and continued over 25 years, as an example of such experi- 
ments. Eotvos proceeded from the circumstance that a body at rest 
near the Earth’s surface, apart from the reaction of its support, 
experiences the gravitational force F g directed toward the Earth’s 
centre and also the centrifugal force of inertia F C f directed perpendic- 
ularly to the Earth’s axis of rotation (Fig. 6.5 — this figure is not 


180 


The Physical Fundamentals of Mechanics 


drawn to scale — the magnitude of the centrifugal force is two orders 
smaller than that of the gravitational force, see Sec. 4.2). The grav- 
itational force is proportional to the gravitational mass of a body 
m g \ 

Fg = m e g 

( g ' is the gravitational intensity). The centrifugal force of inertia 
is proportional to the inertial mass 7n\ n . According to Eq. (4.7), 
its magnitude is determined by the expression 

Fct = m ln (D 2 i? E cos <p 

where cp is the latitude of the locality. It follows from Fig. 6.5 that 
the magnitude of the vertical component of the centrifugal force 
of inertia is 

Fyert = Fct cos q> = m ln CD 2 i?E cos 2 <p == Am in 

We have introduced the symbol A = co 2 i? E cos 2 cp. Eotvos ran his 
experiment at the latitude of cp — 45 degrees. In this case the 
coefficient A is about one-hundredth of g ' . 

The magnitude of the horizontal component of the force F ct is 


F hor = Fct sin <P = m in (d 2 R^ cos cp sin cp = Bm lu 

where B = co 2 i? E cos q> sin cp (for cp = 45°, the values of the 
coefficients A and B coincide). 

Eotvos suspended a rod with bodies fastened to its ends on an 
elastic thread (Fig. 6.6). The bodies were of different materials, 
but their masses were as equal as possible. A mirror was attached 
to the bottom part of the thread. The beam from the light source 
reflected from the mirror struck the cross hairs of a telescope. The 
arms V and 1* were selected so that the rod was in equilibrium in the 
vertical plane. The condition for this equilibrium is as follows: 

(m' e g' - m[ n A) V = (; m\g ' - m^A) V (6.21) 

The instrument was arranged with the rod perpendicular to the 
plane of the meridian (see Fig. 6.6). In this case, the horizontal 
components of the centrifugal force of inertia set up a twisting 
moment equal to 

M t = m\ n BV - ml n Bl m (6.22) 


Eliminating the arm V from Eqs. (6.21) and (6.22), we can arrive 
at the following equation after simple transformations: 


M t 


= m; n sr[i 


(m' g /m[ n )g’ — A n 

KKn) g'~ A J 


It can be seen from this equation that when the ratio of the gravita- 
tional and inertia masses is the same for both bodies, the moment 
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twisting the thread must vanish. If the ratio m g /m 1Tl for the first 
and second bodies is not the same, the twisting moment differs from 
zero. In this case when the entire instrument is turned through 
180 degrees, the twisting moment would reverse its sign and the 
light spot would move from the cross hairs of the telescope (Fig. 6.7). 
Eotvos compared eight different bodies (including a wooden one) 
with a platinum body taken as the standard and discovered no 
twisting of the thread. This gave him the grounds to state that the 
ratio m g /m in for these bodies is identical with an accuracy of 10 -8 . 




In 1961-1964, R. Dicke improved Eotvos’s method. He used the 
Sun’s gravitational field and the centrifugal force of inertia due 
to the Earth’s orbital motion for producing the twisting moment. 
As a result of his measurements, he arrived at the conclusion that 
the ratio m g /m in is the same for the studied bodies with an accuracy 
of 10 -11 . Finally, in 1971, V. Braginsky and V. Panov obtained 
the constancy of the ratio with an accuracy up to 10~ 12 . 

Thus, all the experimental facts indicate that the inertial and 
gravitational masses of all bodies are strictly proportional to each 
other . This signifies that these masses become identical when the 
units are selected properly. This is why physicists simply speak of 
mass. Albert Einstein based his general theory of relativity on the 
gravitational and inertial -masses being identical. 

We have already noted in Sec. 4.1 that the forces of inertia are 
similar to gravitational forces — both are proportional to the mass 
of the body which they are acting on. We have indicated there that 
if we are in a closed cab, no experiments will help us to establish 
what the action of the force mg is due to — whether it is due to the 
cab moving with the acceleration — g, or to the fact that the sta- 
tionary cab is near the Earth’s surface. This statement forms the 
content of the so-called equivalence principle. 

The identical nature of inertial and gravitational masses is the 
result of the equivalence of forces of inertia and gravitational forces. 



182 


The Physical Fundamentals of Mechanics 


It must be noted that from the very beginning we assumed in 
Eq. (6.1) that the mass coincides with the inertial mass of bodies, 
and we therefore determined the numerical value of G assuming 
that m g = n . We can thus write Eq. (6.20) in the form 

a = G-%g- (6.23) 

Equation (6.23) permits us to determine the mass of the Earth Af E . 
Use of the measured values of a, i? E and G in it gives the value 
of 5.98 X 10 24 kg for the Earth’s mass. 

Further, knowing the radius of the Earth’s orbit i? or b and the 
time T of one complete revolution of the Earth about the Sun, 
we can find the Sun’s mass Ms- The Earth’s acceleration equal 
to <o 2 i? 0 rb (the angular velocity co = 2n/T) is due to the force with 
which the Sun attracts the Earth. Hence, 

^orb 

whence we can calculate the Sun’s mass. 

The masses of other celestial bodies were determined in a similar 
way. 


6.4. Orbital and Escape Velocities 

To travel about the Earth in a circular orbit with a radius differing 
only slightly from the Earth’s radius i? E , a body must have a defi- 
nite velocity Its value can be found from the condition of equal- 
ity of the product of the mass of the body and its acceleration to 
the force of gravity acting on the body: 

v\ 

Hence, 

Vi — VgR e (6.24) 

Consequently, for a body to become a satellite of the Earth, it 
must be given the velocity v x called the tangential or orbital veloc- 
ity (v x is also sometimes called the first cosmic velocity). Introduc- 
tion of the values of g and /?e gives the following value for the 
orbital velocity: 

v x = Y = V 9-8 X 6.4 xl0 6 «8xl0 3 m/s = 8 km/s 

A body having the velocity u x will not fall onto the Earth. This 
velocity, however, is not sufficient for the body to leave the sphere 
of the Earth’s attraction, i.e. to travel away from the Earth over 
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a distance such that its attraction to the Earth stops playing a sig- 
nificant part. The velocity v 2 required for this purpose is called 
the escape velocity (the second cosmic velocity). 

To find the escape velocity, we must calculate the work that must 
be done against the forces of the Earth’s attraction for moving 
a body from the Earth’s surface to infinity. When a body moves 
away, the forces of the Earth’s attraction do the following work on it: 

A =2? p#ln i t Ep t ft n 


According to Eq. (6.19), the initial potential energy is 


E 


p. lnit 


= —G 




and the final potential energy is zero. Thus, 


iiB 


The work A that must be done against the forces of the Earth’s 
attraction equals the work A ' taken with the opposite sign, i.e. 


JiE 


(6.25) 


Disregarding the difference between the force of gravity mg and 
the force of gravitational attraction of a body to the Earth, we 
can write that 


n M E™ 

m Z = G -R£- 

Hence, 


Consequently, the work (6.25) can be written in the form 

A = mgR& (6.26) 


A body leaving the Earth does this work at the expense of its 
store of kinetic energy. For this store of energy to be sufficient for 
doing the work (6.26), the body must be projected from the Earth’s 
surface with a velocity v not lower than the value v 2 determined by 
the condition 

-=i =mga* 

whence 

v 2 = V2fRZ (6.27) 

It is exactly the velocity v t that is the escape velocity from the 
Earth, or the second cosmic velocity. A comparison of Eqs. (6.27) 
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and (6.24) shows that this velocity is |/2 times greater than the 
orbital one. Multiplying 8 km/s by Y 2, we get the approximate 
value of 11 km/s for v 2 - 

It must be noted that the required magnitude of the velocity 
does not depend on the direction in which a body is launched from 
the Earth. This direction only affects the shape of the trajectory 
along which the body travels away from the Earth. 

To leave the solar system, a body must overcome the forces of 
attraction to the Sun in addition to the Earth’s attraction. The 
velocity of launching a body from the Earth’s surface needed for 
this purpose is called the escape velocity from the solar system, 
the space velocity, or the third cosmic velocity v 3 . The velocity v 3 
depends on the direction of launching. When a body is launched 
in the direction of orbital motion of the Earth, this velocity is 
minimum and is about 17 km/s (in this case the body’s velocity 
relative to the Sun is the sum of its velocity relative to the Earth 
and the velocity with which the Earth is travelling about the Sun). 
When a body is launched in a direction opposite to that of the Earth’s 
rotation, u 3 & 73 km/s. 

The orbital and escape velocities were reached for the first time 
in the USSR. On October 4, 1957, the first successful launching of 
an artificial satellite of the Earth in the history of mankind was 
carried out in the Soviet Union. A second advance occurred on 
January 2, 1959. This day saw the launching from Soviet soil of 
a spaceship that escaped from the sphere of the Earth’s attraction 
and became the first artificial planet of our solar system. On April 12, 
1961, the first flight of a man into outer space was accomplished 
in the Soviet Union. The first Soviet cosmonaut Yuri Gagarin 
completed a flight around the Earth and landed successfully. 



CHAPTER 7 OSCILLATORY 

MOTION 

7.1. General 

Oscillations are defined as processes distinguished by a certain 
degree of repetition. For example, the swings of a clock pendulum, 
the vibrations of a string or the leg of a tuning fork, and the voltage 
across the plates of a capacitor in a radio receiver circuit have this 
property of repetition. 

Depending on the physical nature of the repeating process, we 
distinguish mechanical, electromagnetic, sound, and other oscilla- 
tions. In the present chapter, we shall deal with mechanical oscil- 
lations. 

Oscillations (vibrations) are widespread in nature and engineering. 
They often have a negative influence. The oscillations of a railway 
bridge due to the impacts imparted to it by the wheels of a train 
passing over the rail joints, the vibrations of a ship’s hull caused 
by rotation of the propeller, the vibrations of the wings of an aircraft 
are all processes that may have catastrophic consequences. The task 
in such cases is to prevent the setting up of oscillations or at any 
rate to prevent them from reaching dangerous magnitudes. 

Oscillatory processes are also at the very foundation of various 
branches of engineering. For instance, radio engineering owes its 
very existence to oscillatory processes. 

Depending on the nature of the action on an oscillating system, 
we distinguish free (or natural) oscillations, forced oscillations, 
auto-oscillations, and parametric oscillations. 

Free or natural oscillations occur in a system left alone after 
an impetus was imparted to it or it was brought out of the equilib- 
rium position. An example are the oscillations of a ball suspended 
on a string (a pendulum). To initiate oscillations, we may either 
push the ball or move it to a side and release it. 

In forced oscillations, the oscillating system is acted upon by an 
external periodically changing force. An example here are the oscil- 
lations of a bridge set up when people walking in step pass 
over it. 

Auto-oscillations, like forced ones, are attended by the action 
of external forces on the oscillating system, but the moments of 
time when these actions are exerted are set by the oscillating system 
itself — the latter controls the external action. Examples of an auto- 
oscillating system are clocks in which a pendulum receives pushes 
at the expense of the energy of a lifted weight or a coiled spring. 
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and these pushes occur when the pendulum passes through its middle 
position. 

In parametric oscillations, external action causes periodic changes 
in a parameter of a system, for instance, in the length of a thread 
on which an oscillating ball is suspended. 

Harmonic oscillations are the simplest ones. These are oscillations 
when the oscillating quantity (for example, the deflection of a pen- 
dulum) changes with time according to a sine or cosine law. This 
kind of oscillations is especially important for the following reasons: 
first, oscillations in nature and engineering are often close to har- 
monic ones in their character, and, second, periodic processes of 
a different form (with a different time dependence) can be represent- 
ed as the superposition of several harmonic oscillations. 


7.2. Small-Amplitude Oscillations 

Let us consider a mechanical system whose position can be set 
with the aid of a single quantity which we shall designate x. The 
system is said to have one degree of freedom in such cases. The angle 
measured from a certain plane, or the distance measured along a giv- 
en curve, in particular a straight line, etc. may be the quantity x 
determining the position of the system. The potential energy of the 
system will be a function of the single variable x: E p = 2? p (x). 
Assume that the system has a position of stable equilibrium. In 
this position, the function E p (x) has a minimum (see Sec. 3.9). 
We shall measure the coordinate x and the potential energy E p 
from the position of equilibrium. Hence E p (0) = 0. 

Let us expand the function E p (x) in a power series and consider 
only small-amplitude oscillations, so that the higher powers of x 
may be disregarded. According to the Maclaurin theorem 

E p (x) = E p (0) + £p (0)* + 4- E” k ( 0) x* 

(owing to the small value of x we disregard the remaining terms). 
Since E p (x) at x = 0 has a minimum, then E p (0) equals zero, and 
E p (0) is positive. In addition, according to our condition, E p (0) = 
= 0. Let us introduce the symbol E p (0) = k (where k > 0). Hence, 

Ep {x)=\kx* (7.1) 

Equation (7.1) is identical with Eq. (3.78) for the potential energy 
of a deformed spring. Using Eq. (3.32), let us find the force acting 
on the system: 



(7.2) 
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This equation gives the projection of the force onto the direction x. 
In the following, we shall omit the subscript x in designating the 
force, i.e. we shall write Eq. (7.2) in the form F = — kx. 

Equation (7.2) is identical with Eq. (2.26) for the elastic force 
of a deformed spring. This is why forces of the kind shown by Eq. (7.2) 
regardless of their nature, are called quasi-elastic. It is easy to see 
that a force described by Eq. (7.2) is always directed toward the 
position of equilibrium. The 


mmm 
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magnitude of the force is propor^ 
tional to the deviation of the 
system from its equilibrium posi- 
tion. A force having such prop- 
erties is sometimes defined as 
a restoring force. 

Let us consider as an example 
a system consisting of a ball of 
mass m suspended on a spring 
whose mass may be ignored in 
comparison with m (Fig. 7.1). 

In the equilibrium position, the 
force mg is balanced by the elas- 
tic force A:AZ 0 ; 

mg = kAl 0 (7.3) 

(AZ 0 is the elongation of the 
spring). We shall characterize 
the displacement of the ball from 
its equilibrium position by the 

coordinate x with the ar-axis directed vertically downward and the 
zero of the axis coinciding with the position of equilibrium of the 
ball. If we shift the ball to the position characterized by the coor- 
dinate x, then the elongation of the spring will become equal to 
AZ 0 + x y and the projection of the resultant force onto the x-axis 
will acquire the value F = mg — k (AZ 0 + x). Taking into account 
condition (7.3), we find that 


Fig. 7.1 


F = —kx (7.4) 

Thus, in the example considered, the resultant of the force of grav- 
ity and of the elastic force has the nature of a quasi-elastic force. 

Let us impart the displacement x = a to the ball and then leave 
the system alone. Under the action of the quasi-elastic force, the 
ball will move toward its equilibrium position with the constantly 

growing velocity v = x. The potential energy of the system will 
diminish (Fig. 7.2), but a constantly growing kinetic energy 2? k = 

= y mi? will appear instead (we disregard the mass of the spring). 
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Arriving at its equilibrium position, the ball continues to move by 
inertia-. This motion will be retarded and will stop when the kinetic 
energy completely transforms into potential energy, i.e. when the 
displacement of the ball becomes equal to — a . Next the same process 
will repeat when the ball moves in the reverse direction. If friction 
is absent in the system, its energy should be conserved, and the ball 
will move within the limits from x = a to x = — a for an infinitely 
long time. 

The equation of Newton’s second law for the ball is 

mx= — kx (7.5) 

Introducing the symbol 



we can transform Eq. (7.5) as follows: 

x -\-(o 2 0 x = 0 (7.7) 

Since klm > 0, then co 0 is a real quantity. 

Thus, in the absence of forces of friction, motion under 

the action of a quasi-elastic force is 
described by the differential equa- 
tion (7.7). 

Any real oscillatory system con- 
tains resistance or damping forces 
whose action leads to diminishing 
of the energy of the system. If the 
depletion of the energy is not replen- 
ished as a result of the work of exter- 
nal forces, the oscillations will be 
damped. In the simplest and also the 
most frequently encountered case, the 
damping force F* is proportional to the magnitude of the velocity: 

F*=-rx (7.8) 

Here r is a constant called the resistance coefficient. The minus sign 
is due to the circumstance that the force F* and the velocity v have 
opposite directions, consequently, their projections onto the a:-axis 
have opposite signs. 

The equation of Newton’s second law when damping forces are 
present has the form 



mx— — kx — rx 


(7.9) 



Introducing the notation 


2 P = ^r (7.10) 

and using Eq. (7.6), we can rewrite Eq. (7.9) as follows: 

x + 2$x+a\x = 0 (7.11) 

This differential equation describes the damping oscillations of 
a system. 

The oscillations described by Eqs. (7.7) and (7.11) are free (or 
natural): a system brought out of its equilibrium position or having 
received an impetus performs oscillations when left alone. Now 
assume that an oscillatory system experiences an external force 
that changes with time according to a harmonic law: 

F x = F 0 cos a>t (7.12) 

In this case, the equation of Newton’s second law has the form 

• • • 

mx — — kx — rx-\-F 0 cos oof 

Using Eqs. (7.6) and (7.10), let us write this equation as follows: 

x + 20# + co \x = / 0 cos of (7-13) 

where 



Equation (7.13) describes forced oscillations. 

We have established that when studying various kinds of oscil- 
lations we are confronted with the need to solve differential equa- 
tions of the form 

x -\-ax + bx = f (t) (7.15) 

where a and b are constants, and /(f) is a function of f. An equation 
such as (7.15) is called a linear differential equation with constant 
coefficients. For Eq. (7.7), we have a — 0 and b — ©J and for Eq. 
(7.11), we have a = 2p and b = co|. In both cases, the func- 
tion / (f) identically equals zero: f (f) = 0. For forced oscillations, 
/ (t) — fo cos <£>t - 

The solution of Eq. (7.15) is greatly facilitated if we pass over 
to complex quantities. This is why we shall stop to briefly treat 
complex numbers and methods of solving linear differential equa- 
tions with constant coefficients. 
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7 . 3 . Complex Numbers 

The complex number z is defined as a number of the kind 

Z a X + iy (7.16) 

where x and y are real numbers, and i is imaginary unity (£ 2 = — 1). 
The number x is called the real part of the complex number z. This 
is written symbolically in the form x = Re {z}. The number y is 

the imaginary part of z (sym- 
bolically y = Im {z}). The 
number 

z* — x — iy (7.17) 

is called the complex conju- 
gate of the number x + iy . 

The real number x can be 
depicted by a point on the x- 
axis. The complex number z 
can be depicted by a point on 
a plane with the coordinates 
x and y (Fig. 7.3). Each point 
of the plane corresponds to a 
complex number z. Conse- 
quently, a complex number can be given in the form of Eq. (7.16) 
with the aid of the Cartesian coordinates of the relevant point. The 
same number, however, can be given with the aid of the polar co- 
ordinates p and (p. The following relations exist between the two 
pairs of coordinates: 

ar = pcos<p, y = psinq> 

p =s y^x 2 + y 2 9 cp — arctan ~ 

The distance from the origin of coordinates to the point depicting 
the number z is called the absolute value or modulus of the complex 
number (its symbol is |z|). It is obvious that 

\Z\ = p = )Ar 2 + i/ 2 (7.19) 

The quantity q> is called the argument of the complex number z. 

With a view to Eqs. (7.18), we can write a complex number in the 
trigonometric form: 

z = p (cos «p + i sin <p) (7.20) 

Two complex numbers z x = x x + iyi and z 2 = x 2 + iy 2 are 
considered to equal each other if their real and imaginary parts 
are separately equal, namely, 

z t = Zj if x t = x t and y t = y. 


} 


(7.18) 



(7.21) 
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The moduli of two equal complex numbers are identical, while 
their arguments can differ only in an addend that is a multiple 
of 2n: 

Pi === p 2 » <Pt = <p2±2 kn (7.22) 

where k is an integer. 

Examination of Eqs. (7.16) and (7.17) shows that when z* = z 9 
the imaginary part of z vanishes, i.e. the number z is a pure real 
number. Thus, the condition for the number z being real can be writ- 
ten in the form 

2 * = z (7.23) 

The relation 

e'V = cos <p + i sin q> (7.24) 

is proved in mathematics and is called the Euler formula. Substi- 
tuting — <p for <p in this equation and bearing in mind that cos ( — <p) = 
= cos <p and sin ( — (p) = — sin <p we get 

e~ i( P ss cos cp — i sin q) (7.25) 

Let us summate Eqs. (7.24) and (7.25) and solve the resulting 
equation relative to cos <p. We obtain 

cos q> = (e'V + <?-*<?) (7.26) 

Subtraction of Eq. (7.25) from (7.24) yields sin <p = (e iff> — e~^)l2i. 

Equation (7.24) can be used to write a complex number in the 
exponential form: 

z = pe i * (7.27) 

[see Eq. (7.20)]. The complex conjugate number in the exponential 
form is 

= pe-i* (7.28) 

In the addition of complex numbers, their real and imaginary 
parts are added separately: 

+ 2 2 = (ar x + x 2 ) + i (y 1 + y 2 ) (7.29) 

It is convenient to multiply complex numbers by taking them 
in the exponential form: 

z — z i • z 2 = pie'*' • p 2 e iq> * — p 1 p 2 e i < < P* +<p *> (7.30) 

The moduli of the complex numbers in this case are multiplied, 
and the arguments are added: 

P = PlP 2 . <P = <Pl+<P 2 ( 7 - 31 ) 

Complex numbers are divided in a similar way: 

h. — Piel<P ‘ = -£l_ e i(<P,-V.) 

z 2 P a 


Z — 


(7.32) 
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It is a simple matter to find from Eqs. (7.27) and (7.28) that 

zz* = p 2 (7.33) 

(the square of the modulus of a complex number equals the product 
of this number and its complex conjugate). 


7.4. Linear Differential Equations 

An equation of the kind 

x + ax + bx = / (*) (7.34) 

where a and b are constants, and f(t) is a given function of t y 
is called a linear differential equation of the second order with 
constant coefficients. The constants a and b may also be zero. 

If the function f(t ) is identically equal to zero [/(£)=()], the 
equation is called homogeneous, otherwise it is called non-homo- 
geneous. A homogeneous equation has the form 

x + ax +bx = 0 (7.35) 

The solution of any second-order differential equation (i.e. with 
a second derivative as the senior term) contains two arbitrary con- 
stants C x and C 2 . This can be understood in view of the circumstance 
that a function is determined from its second derivative by double 
integration. An integration constant appears upon each integration. 
Let us consider as an example the equation 

*x = 0 (7.36). 

Integration of this equation yields x = C x . Repeated integration 
results in the function 

x = c x t + C 2 (7.37) 

It is easy to see that the function (7.37) satisfies Eq. (7.36) with any 
values of the constants C x and C 2 . 

Assigning definite values to the constants C t and C 2l we get the 
so-called partial solution of a differential equation. For example, 
the function 5£ + 3 is one of the partial solutions of Eq. (7.36). 

The multitude of all the partial solutions without any excep- 
tion is called the general solution of a differential equation. The 
general solution of Eq. (7.36) has the form of Eq. (7.37). 

It is proved in the theory of linear differential equations that if 
x x and x 2 are linearly independent* solutions of the homogeneous 

* The functions x, and x 2 are called linearly independent if the relation 
a i*i + s 0 is obeyed only when a x and a 2 equal zero. 
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equation (7.35), then the general solution of this equation can be 
represented in the form 

x = C l x 1 + C^c 2 (7.38) 

where C x and C 2 are arbitrary constants. 

Assume that x n ( t , C Xl C 2 ) is the general solution of the non- 
homogeneous equation (7.34) (the arbitrary constants C x and C 2 
are parameters in this solution), and x n (£) is one of the partial 
solutions of the same equation (it contains no arbitrary constants). 
We shall introduce the notation 

x (£, Ci, C 2 ) = x n (£, Ci, C 2 ) — x n (£) 


The general solution of the non-homogeneous equation can therefore 
be written in the form 

•^n i C 2 ) = x n (£) -4“ x (£, Cj, C 2 ) (7.39) 

The function (7.39) satisfies Eq. (7.34) at any values of the constants 
C x and C 2 . Consequently, we can write the relation 

x n (^) ”1” x (^» ^ 2 ) “1” ax n (f) “I" ax (f 1 C i» ^ 2 ) “I" 

+ bx n (t)+bx(t, C u C 2 ) = f(t) 

Grouping of the addends yields 

x (ti C?)-\-ax{ti C|, C^^bxity C Jf C 2 ) + 

+ l x n (0 + ax n (t) +bx n (t)) = / (t) (7.40) 

The partial solution x n ( t ) also satisfies Eq. (7.34). Consequently, 
the expression in brackets in the left-hand side of Eq. (7.40) equals 
the right-hand side of this equation. It thus follows that the function 
x (£, C x , C 2 ) must satisfy the condition 

x(t, C 2 )-\-ax(t , C 2 ) -bx(t, C Xl C 2 ) = 0 

i.e. it is the general solution of the homogeneous equation (7.35). 
We have therefore arrived at a very useful theorem: the general 
solution of a non-homogeneous equation equals the sum of the general 
solution of the corresponding homogeneous equation and a partial solu- 
tion of the non-homogeneous equation : 

#gen, non-hom = ^gen, hom ^part, non-hom (7.41) 

Linear homogeneous differential equations with constant coeffi- 
cients are solved using the substitution 

x (£) = e u 


(7.42) 
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where X is a constant quantity. Differentiation of the function (7.42) 
yields 

x ( t ) = Xe™, x (t) = XH M (7.43) 

The introduction of Eqs. (7.42) and (7.43) into (7.35) results in the 
following equation, after the factor e u differing from zero is can- 
celled out: 

y? sX + h = 0 (7.44) 

This equation is called a characteristic one. Its roots are the values 
of X at which the function (7.42) satisfies Eq. (7.35). 

If the roots of Eq. (7.44) do not coincide (Xj =/= X 2 ) the functions 
exp (X]£) and exp (X 2 £) will be linearly independent. Consequently, 
according to Eq. (7.38), the general solution of Eq. (7.35) can be 
written as follows: 

x^C^+C^ (7.45) 

It can be shown that when ^ = X 2 = X the general solution of 
Eq. (7.35) is as follows: 

x = Cie u +C 2 te^ (7.46) 

Assume that the coefficients a and b are real, while the function 
in the right-hand side of Eq. (7.34) is complex. Writing this func- 
tion in the form / ( t ) + i<p ( t ), we arrive at the equation: 

z + az + bz = f -{- i<p (7.47) 

(we have used the symbol z to denote the required function). The 
solution of this equation will evidently be complex. Writing the 
solution in the form z (t) = x ( t ) + iy ( t ), we shall introduce it 
into Eq. (7.47). The result will be 

• • • • • • 

x + iy + ax -f aiy + bx -f- biy = f + £q> (7.48) 

When complex numbers are equal to each other, their real and 
imaginary parts equal each other separately [see Eq. (7.21)1. Hence, 
Eq. (7.48) breaks up into two separate equations: 

x + ax + bx = f (t), y + ay + by = <p (t) (7.49) 

the first of which coincides with Eq. (7.34). This property of Eq. (7.48) 
allows us to use the following procedure that sometimes facilitates 
calculations quite significantly. Let us assume that the right-hand 
side of Eq. (7.34) we are solving is real. By adding an arbitrary 
imaginary function to it, we reduce the equation to the form of 
Eq. (7.47). After next finding the complex solution of the equation, 
we take its real part. It will be the solution of the initial equation 
[Eq. (7.34)1. 
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7.5. Harmonic Oscillations 

Let us consider oscillations described by the equation 

x + cdJz = 0 (7.7) 

Such oscillations are performed by a body of mass m experiencing 
only the quasi-elastic force F = —kx. The coefficient of * in 
Eq. (7.7) has the value 



Using the expression x — exp (kt) [see Eq. (7.42)] in Eq. (7.7), 
we arrive at the characteristic equation 

X* + o>J = 0 (7.50) 

This equation has imaginary roots: = +ito 0 and X 2 = — ico 0 . 

According to Eq. (7.45), the general solution of Eq. (7.7) has the 
form 

x = C t e ia> ot -(- C 2 e~ l(0 ot (7.5!) 

where C x and C 2 are complex constants. 

The function x ( t ) describing the oscillation must be real. For 
this end, the coefficients C 1 and C 2 in Eq. (7.51) must be selected 
so as to observe the condition [see Eq. (7.23)1: 

+ = C t e ib> ot C 2 e- to «* (7.52) 

[we have quated expression (7.51) to its complex conjugate]. Equa- 
tion (7.52) will be obeyed if C x = CS (in this case C 2 = C*). Let 
us write the coefficients C x and C 2 satisfying this condition in the 
exponential form [see Eq. (7.17)], denoting their modulus by A/2 
and their argument by a: I 4 .^ Lj\, l 

= C 2 = -§-e-*« \Ct (7.53) 

I CP a ^ 

Introduction of these expressions into Eq. (7.51) yields 

x = = .4 cos (co 0 ^+a) (7-54) 

[see Eq. (7.26)]. Thus, the general solution of Eq. (7.49) is 

x =A cos (cd 0 £ + a) (7.55) 

where A and a are arbitrary constants*. 

* The solution of Eq. (7.7) can be written in two other ways. Let us trans- 
form Eq. (7.55) according to the formula for the cosine of a sum: 

x = A (cos a cos © 0 t — sin a sin © 0 *) 
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Thus, the displacement x changes with time according to a cosine 
law. Consequently, the motion of the system experiencing the action 
of a force of the kind F = — kx is a harmonic oscillation. 

I A gt&ph of the harmonic oscillation, i.e. one of the function 
) (7.55), is show n in Fig. 7.4. The time t^igjaid off along the horizontal 
1 axis, and the displacement x 

along the vertical one. Since a 
cosine varies from — 1 to +1, 
then the values of x range from 
— A to +A. 

The maximum deviation of a 
system from its equilibrium po- 
* sition is called the amplitude of 
oscillation. The amplitude A is 
a constant positive quantity. Its 
value is determined by the mag- 
nitude of the initial deviation 
Fig. 7.4 or push that brought the system 

out of the equilibrium position. 

The cosine argument (co 0 f + a) is called the phase of oscillation. 
The constant a i s the value of the phase at the moment t = 0 and 
is called the initial phase of oscillation. The constant a will change 
when the moment from which we begin to measure the time is 
changed. Hence, the value of the initial phase is determined by when we 
begin to measure the time. Since the value of x does not change when 
a whole number of 2n’s is added to or subtracted from the phase, 
we can always ensure that the initial phase will be less than Jt in 
magnitude. This is why only values of a within the limits from — n 
to +ji are usually considered. 

Since a cosine is a periodic function with the period 2jt, different 
states* of a system performing harmonic oscillations repeat during 
the time interval T in which the phase of oscillation receives an 
increment equal to 2 jc (Fig. 7.4). The interval T is called the period 
of oscillation. It can be found from the condition | co 0 (t + T) + 

and introduce the notation 

c ± — A cos a, c 2 = — A sin a 
The function x ( t ) can therefore be written in the form 

x = cos (o 0 t + c 2 sin co 0 t 
where c* and c 2 are arbitrary constants. 

Finally, using Eq. (7.24), we can write Eq. (7.55) as follows: 

*=Re { i 4e i (‘ a «‘+«)} 

* We remind our reader that the state of a mechanical system is character- 
ized by the values of the coordinates and the velocities of the bodies forming 
the system. 
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-+- a | = | a i 0 t + a | + 2n, whence 

T -~ (7-56) 

The number of oscillations in unit time is called the frequency 
of oscillation v. It is quite evident that the frequency v is related 
to the duration of one oscillation T by the expression 

v = 4 (7.57) 

The unit of frequency is the frequency of oscillations whose period 
is 1 s. This unit is called the hertz (Hz). A frequency of 10 s Hz is 
called a kilohertz (kHz), and of 10® Hz a megahertz (MHz). 

It follows from Eq. (7.56) that 

•V-T- (7.58) 

Thus, o 0 is the number of oscillations in 2n seconds. The quantity 
<d 0 is called the circular or cyclic frequency. It is related to the 
conventional frequency v by the expression 

co 0 = 2jiv (7.59) 

Time differentiation of Eq. (7.55) yields an expression for the 
velocity: 

17 = a: = — A(o 0 sin (<d 0 £ + a) = ^4o) 0 cos^ <o 0 t + a +-^“) (7.60) 

Examination of Eq. (7.60) shows that the velocity also changes 
according to a harmonic law, the amplitude of the velocity being 
A <d 0 - ^ follows from a comparison of Eqs. (7.55) and (7.60) that 
the phase of the velocity is in advance of that of the displacement 
by n/2. 

Time differentiation of Eq. (7.60) yields an expression for the 
acceleration: 

a = x= — 4 (dJcos (cd 0 £ +a) ==^4co*cos(o) 0 ^-(-a + n) (7.61) 
It can be seen from Eq. (7.61) that the acceleration and the displace 
ment are opposite in phase. This signifies that when the displace- 
ment reaches its maximum positive value, the acceleration reaches 
its maximum negative value, and vice versa. 

Figure 7.5 compares graphs for the displacement, velocity, and 
acceleration. 

A particular oscillation is characterized by definite values of the 
amplitude A and the initial phase a. The values of these quantities 
for a given oscillation can be determined from the initial conditions, 
i.e. from the values of the deviation x 0 and the velocity v 0 at the 
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initial moment. Indeed, assuming in Eqs. (7.55) and (7.60) that 
t = 0, we get two equations: 

x 0 = A cos a f v 0 = — Aa 0 sin a 
from which we find that 

+ ^ P-62) 

(7 63 ) 

Equation (7.63) is satisfied by two values of a within the interval 
from — n to +ji. That value must be taken which gives the correct 
signs of cosine and sine. 

A quasi-elastic force is conservative. Therefore, the total energy 
of a harmonic oscillation must remain constant. In the course of 

oscillations, kinetic energy transforms 
into potential energy, and vice ver- 
sa. At the moments of maximum de- 
viation from the equilibrium position, 
the total energy E consists on ly o f po- 
tential energy, which reaches its 
maximum value E p> max : 

E = E 9 . max = k ~Y- (7-64) 

When the system passes through its 
equilibrium position, the total energy 
consists only of kinetic energy, which 
at these moments reaches its maxi- 
mum value E* t max : 

E = E u Q^f= 


(it was shown above that the velocity 
amplitude is -4co 0 ). Equations (7.64) and (7.65) equal each other 
because by Eq. (7.6) we have mcoj = k . 

Let us see how the kinetic and potential energies of harmonic 

oscillations change with time. The kinetic energy is [see Eq. (7.60) 
• ^ 

for x] 

n mx 2 mA 2 coj . 0 , . , N 

E k = -j- = — g ~ sln (“o* + «) 




Fig. 7.5 


( 7 . 66 ) 
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The potential energy is expressed by the equation 

E p = cos 2 (co 0 f + ®) (7.67) 

Adding Eqs. (7.66) and (7.67) and bearing in mind that /n<nj = k, 
we get a formula for the total energy: 

£ = + = (7.68) 

[compare with Eqs. (7.64) and (7.65)]. Thus, the total energy of 
a harmonic oscillation is indeed con- 
stant. 

Using formulas of trigonometry, we 
can write the expressions for E k and 
E p as follows: 

E^ — E sin 2 (©o* + a) = 

= i-cos2(co 0 f + a)J (7.69) 

E V = E cos 2 (a> 0 t + a) = 

=^[4*+4* cos2 < e> o < + a >] ( 7 - 70 > 

where E is the total energy of the sys- 
tem. A glance at these equations 
shows that E * and E p change with a 
frequency of 2© 0 , i.e. with a frequency 
twice that of the harmonic oscilla- 
tions. Figure 7.6 compares graphs 
for x, i?k, and E v . 

It is known that the mean value 
of sine square and of cosine square equals one-half. Hence, the 
mean value of E k coincides with that of E v and equals E/2 . 





7.6. The Pendulum 

Physicists understand a pendulum to be a rigid body performing 
oscillations about a fixed point or axis under the action of the force 
of gravity. 

A mathematical or simple pendulum is defined as an idealized 
system consisting of a weightless and unstretchable string on which 
a mass concentrated at one point is suspended. A sufficiently close 
approximation to a simple pendulum is a small heavy sphere sus- 
pended on a long thin thread. 
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t We shall characterize the deviation of a pendulum from its equi- 
librium position by the angle qp made by the thread with a vertical 
line (Fig. 7.7). Deviation of a pendulum from its equilibrium posi- 
tion is attended by the appearance of a rota- 
tional moment (torque) M whose magnitude is 
mgl sin qp (here m is the mass and l is the length 
„ of the pendulum). Its direction is such that it 
P tends to return the pendulum to its equilibrium 

position, and is similar in this respect to a 
l > quasi-elastic force. Therefore, opposite signs 
0 * must be assigned to the moment M and the 
angular displacement qp*, as is done to the 
displacement and the quasi-elastic force. Hence, 
the expression for the rotational moment has 
the form 

M = — mgl sin qp (7.71) 

Fig. 7.7 Let us write an equation for the dynamics of 

rotation of a pendulum. Denoting the angular 

• • 

acceleration by qp and taking into account that the moment of iner- 
tia of a pendulum is mP we get 

| - ml 2 9 = — mgl sin <p 

This equation can be transformed as follows: 

q> -| — j- sin <p = 0 (7.72) 


Let us consider only small-amplitude oscillations. We can thus 
assume that sin 9 » 9 . Introducing, in addition, the notation 



(7.73) 

we arrive at the equation 


9 + ®«9 = 0 

(7.74) 

similar to Eq. (7.7). Its solution has the form 


9 — A cos (co 0 f + a) 

(7.75) 


Consequently, in small-amplitude oscillations, the angular displa- 
cement of a simple pendulum changes with time according to a har- 
monic law. 


* Considering 9 as a vector related to the direction of rotation by the right- 
hand screw rule (this is permissible at small values of 9 ), the opposite signs of 
M and 9 can be explained by the fact that the vectors M and 9 have opposite 
directions (Fig. 7.7). 





Oscillatory Motion 


201 


Equation (7.73) shows that the frequency of oscillations of a simple 
pendulum depends only on its length and on the acceleration of 
the force of gravity and does not depend on the mass of the pendu- 
lum. Equation (7.56) after Eq. (7.73) is introduced into it gives 
the expression for the period of oscillations of a simple pendulum 
known from school days: 


T*=2n-\f-j (7.76) 

By solving Eq. (7.72), we can obtain the following formula for 
the period of oscillations: 

r_2,j/3 I {l + (4)’ ! sm»4 + (4x4) !! sin'4 + ...} 

(7.77) 

where A is the amplitude of the oscillations, i.e. the greatest angle 
through which a pendulum deflects from its equilibrium position. 

If an oscillating body cannot be treated as 
a point particle, the pendulum is called a phy- 
sical one. When the pendulum deviates from 
its equilibrium position by the angle <p, a rota- 
tional moment (torque) appears that tends to 
return the pendulum to its equilibrium position. 

This moment is 

M = — mgl sin <p (7.78) 

where m is the mass of the pendulum and l is 
the distance between the suspension point O and 
the centre of mass C of the pendulum (Fig. 7.8). 

The minus sign has the same meaning as in 
Eq. (7.71). 

Denoting the moment of inertia of a pendu- 
lum relative to the axis passing through the point of suspension 
by the symbol /, we can write: 

/cp = — mgl sin cp (7.79) 

For small-amplitude oscillations, Eq. (7.79) transforms into 
Eq. (7.74) that we already know: 

’<p+©;<j>=o 

Here toj stands for the following quantity: 




(7.80) 
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It follows from Eqs. (7.74) and (7.80) that with small displace- 
ments from the equilibrium position, a physical pendulum performs 
harmonic oscillations whose frequency depends on the mass of the 
pendulum, the moment of inertia of the pendulum relative to the 
axis of rotation, and the distance between the latter and the centre 
of mass of the pendulum. According to Eq. (7.80), the period of 
oscillation of a physical pendulum is determined by the expression 

' <™*) 


A comparison of Eqs. (7.76) and (7.81) shows that a mathematical 
pendulum of length 


lr = 


f_ 

ml 


(7.82) 


will have the same period of oscillations as the given physical 
pendulum. The quantity (7.82) is called the reduced length of the 
physical pendulum. Thus, the reduced length of a physical pendu- 
lum is the length of a simple pendulum whose period of oscillations 
coincides with that of the given physical pendulum. 

The point on the straight line joining the point of suspension 
to the centre of mass at a distance of the reduced length from the 
axis of rotation is called the centre of oscillation of the physical 
pendulum (see point O' in Fig. 7.8). It can be shown (we invite our 
reader to do this as ah exercise) that when a pendulum is suspended 
by its centre of oscillation O ' , its reduced length and, consequently, 
its period of oscillations will be the same as initially. Hence, the 
point of suspension and the centre of oscillation are interchangeable: 
when the point of suspension is transferred to the centre of oscilla- 
tion, the previous point of suspension becomes the new centre of 
oscillation. This property underlies the determination of the acce- 
leration of free fall with the aid of the so-called reversible pendulum. 
The latter has two parallel knife edges fastened near its ends by which 
it can be suspended in turn. Heavy weights can be moved along 
the pendulum and be fastened to it. The weights are adjusted to 
ensure the pendulum having the same period of oscillations when 
suspended by any of the knife edges. In this case, the distance 
between the knife edges will be l T . By measuring the period of oscil- 
lations of the pendulum and knowing Z r , we can find the accelera- 
tion of free fall g by the equation 
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7.7. Vector Diagram 

The solution of a number of problems, particularly the addition 
of several oscillations of the same direction (or, which is the same, 
the addition of several harmonic functions) is considerably facili- 
tated and becomes clear if we depict Oscillations graphically in the 
form of vectors in a plane. The result 
obtained is called a vector diagram. 

Let us take an axis which we shall 
denote by the symbol x (Fig. 7.9). From 
point O on the axis we shall lay off a 
vector of length A making the angle a with 
the axis. If we rotate this vector with the 
angular velocity <o 0 , then the projection of 
* the end of the vector will move along the 
x-axis within the limits from — A to +A. 

The coordinate of this projection will change with time according 
to the law 

x = A cos (cd 0 £ + a) 

Consequently, the projection of t he tip of the vector onto the axis 
will perform a harmonic oscillation with an amplitude equal to the 
length of the vector, an angular frequency equal to the angular velo- 
city of the vector, and an initial phase equal to the angle formed 
by the vector with the axis at the initial moment. 

It follows from the above that a harmonic oscillation can be given 
with the aid of a vector whose length equals the amplitude of the 
oscillation, while the direction of the vector makes an angle with 
the x-axis that equals the initial phase of the oscillation. 

Let us consider the addition of two harmonic oscillations of the 
same direction and the same frequency. The displacement x of the 
oscillating body will be the sum of the displacements x x and x t9 
which can be written as follows: 

x 1 = A x cos (cd 0 £ + o^) , x 2 as A 2 cos (c o 0 t + a 2 ) (7.83) 

Let us represent both oscillations with the aid of the vectors A x 
and A 2 (Fig. 7.10). We shall construct the resultant vector A accord- 
ing to the rules of vector addition. It is easy to see that the projec- 
tion of this vector onto the x-axis equals the sum of the projections 
of the vectors being added: 

x = x l + x 2 

Consequently, the vector A is the resultant oscillation. This vector 
rotates with the same angular velocity as the vectors A x and A t 
so that the resultant motion will be a harmonic oscillation with 
the frequency co 0 , amplitude A , and the initial phase a. It can be 



Fig. 7.9 
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seen from the construction that 


A 2 = Al+A\ — 2A 1 A 2 cos [n — (a 2 — a x )] = A\-\-A\-\- 2A 1 A 2 cos (a? — a x ) 

(7.84) 

tan «=* ^i»n q i + ^sma 2 


A x cos a x + A 2 cos a a 


(7.85) 


Thus, the representation of harmonic oscillations by means of 
vectors makes it possible to reduce the addition of several oscillations 
to the operation of vector addition. This procedure is especially 

useful in optics, for example, where the 
light oscillations at a point are deter- 
mined as the result of the superposition 
of many oscillations arriving at the given 
point from different sections of a wave- 
front. 

Equations (7.84) and (7.85) can natu- 
rally be obtained by summation of Eqs. 
(7.83) and the corresponding trigonomet- 
ric transformations. But the way we 
have used to obtain these equations is 
distinguished by its great simplicity and 
clarity. 

Let us analyse Eq. (7.84) for the 
amplitude. If the difference between 
the phases of both oscillations a 2 — a* vanishes, the amplitude 
of the resulting oscillation equals the sum of A x and A 2 . If the 
phase difference a 2 — equals +jt or — jt, i.e. both oscillations 
are in counterphase, then the amplitude of the resulting oscillation 
equals \ A x — A 2 |. 

If the frequencies of the oscillations x x and x 2 are not the same, 
the vectors A x and A 2 will rotate with different velocities. In this 
case, the resultant vector A pulsates in magnitude and rotates with 
a varying velocity. Consequently, the resultant motion in this case 
will be a complex oscillating process instead of a harmonic oscilla- 
tion. 



7.8. Beats 

Of special interest is the case when two harmonic oscillations 
of the same direction being added differ only slightly in frequency. 
We shall now show that the resultant motion in these conditions 
can be considered as a harmonic oscillation with a pulsating ampli- 
tude. Such oscillations are called beats. 

Let co stand for the frequency of one of the oscillations and co + 
+ Aco for that of the second one. According to our conditions. 
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Ag> <C < 0 . We shall assume that the amplitudes of both oscillations 
are the same and equal A . To avoid unnecessary complications in 
our formulas, we shall consider that the initial phases of both oscil- 
lations equal zero. The equations of the oscillations will thus become 

Xi = A cos <o* f x 2 = A cos (co + Ao>) t 

By summing these expressions and using the trigonometric formula 
for the sum of cosines, we get 

x = x x -\-x 2 *= (2A cos-^- 1 ) cos co£ (7.86) 

(in the second multiplier we disregard the term Aco/2 in comparison 
with co). A graph of the function (7.86) is shown in Fig. 7.11a. The 
graph has been plotted for co/Aco = 10. 



The multiplier in parentheses in Eq. (7.86) changes much more 
slowly than the second multiplier. Owing to the condition Aco <C < 0 , 
the multiplier in parentheses does not virtually change during the 
time in which the multiplier cos co* performs several complete 
oscillations. This gives us the grounds to consider the oscillation (7.86) 
as a harmonic oscillation of frequency co whose amplitude changes 
according to a periodic law. The multiplier in parentheses cannot 
be an expression of this law because it changes within the limits 
from — 2 A to +2 A whereas the amplitude by definition is a positive 
quantity. A graph of the amplitude is shown in Fig. 7.116. The 
analytic expression of the amplitude obviously has the form 

amplitude = | 2 A cos * J (7.87) 
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The function (7.87) is 
that of the expression 


COS0? 



a periodic function with a frequency double 
inside the magnitude sign (see Fig. 7.12 
comparing graphs of the cosine and 
its magnitude), i.e. with a frequency 
of A cd. Thus, the frequency of pulsa- 
tions of the amplitude — it is called 
the frequency of the beats— equals 
the difference between the frequencies 
of the oscillations being added. 

We must note that the multi- 


Fig. 7.12 


plier 2A cos ^ t 


not 


deter- 


2 v only 

mines the amplitude, but also affects 
the phase of the oscillations. This 
is manifested, for example, in that 
the deflections corresponding to 
adjacent peaks of the amplitude 
have opposite signs (see points M t 
and M 2 in Fig. 7.11a). 


7.9. Addition of Mutually 

Perpendicular Oscillations 

Assume that a point particle can oscillate both along the rr-axis 
and along the y-axis perpendicular to it. If we induce both oscilla- 
tions, the particle will move along a certain, generally speaking, 
curved trajectory whose shape depends on the phase difference be- 
tween the two oscillations. 

Let us choose the beginning of counting time so that the initial 
phase of the first oscillation equals zero. The equations of the oscil- 
lations will therefore be written as follows: 

x = A cos coJ, y = B cos (of + a) (7.88) 

where a is the difference between the phases of the two oscillations. 

Equations (7.88) describe the trajectory along which a body partic- 
ipating in both oscillations moves and are given in the parametric 
form. To obtain an equation of the trajectory in the conventional 
form, we must exclude the parameter t from Eqs. (7.88). It follows 
from the first of the Eqs. (7.88) that 

cos cot = (7.89) 

-V 


Hence, 


sin (s)t 


(7.90) 
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Now let us expand the cosine in the second of the Eqs. (7.88) accord- 
ing to the formula for the cosine of a sum, using instead of cos c ot 
and sin (ot their values from Eqs. (7.89) and (7.90). As a result we get 

y x . , . i* 

= cos a — sm a y 1 — 


This equation after simple transformations can be given the form 


iLmJL 

A » + B* 


2 xy 
AB 


cos a = sin 2 a 


(7.91) 


It is known from analytical geometry that Eq. (7.91) is the equa- 
tion of an ellipse whose axes are oriented arbitrarily relative to the 
coordinate axes x and y . The orientation of the ellipse and the dimen- 
sions of its semiaxes depend in a quite complicated way on the 
amplitudes A and B and the phase difference a. 

Let us study the shape of the trajectory in some particular cases. 

1. The phase difference a equals zero. In this case, Eq. (7.91) 
becomes 



whence we get an equation of a straight line: 

y=T* ( 7 - 92 > 


The oscillating particle moves along this stra ight line , its distance 
from the origin of coordinates being r—^x iJ t-y i . Introducing 
into this equation the expressions (7.88) for x and y and taking into 
account that a = 0, we get the law of the change in r with time: 

r =s Y A 2 + B 2 cos (at (7.93) 


It follows from Eq. (7.93) that the resultant motion is a harmonic 
oscillation along the straight line (7.92) with the frequency <o and 
the amplitude YA 2 + B 2 (Fig- 7.13). 

2. The phase difference a equals rfcm. Equation (7.91) has the form 



whence we find that the resultant motion is a harmonic oscillation 
along a straight line (Fig. 7.14): 



3. When a = ±n/2, Eq. (7.91) becomes 
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i.e. it becomes the equation of an ellipse reduced to the coordinate 
axes, the semiaxes of the ellipse being equal to the corresponding 
amplitudes of the oscillations. When the amplitudes A and B are 
equal, the ellipse degenerates into a circle. 



y 



The cases a = 4-ji/ 2 and a = — rc/2 differ in the direction of 
motion along the ellipse or circle. If a = +n/2, Eqs. (7.88) can be 
written as follows: 

x — A cos tot , y = — B sin oat (7.95) 

At the moment t = 0, the body is at point 7|(Fig. 7.15). At the fol- 
lowing moments, the coordinate x diminishes, while the coordinate 
y becomes negative. Consequently, the motion is clockwise. 

y 



When a = — n/2, the equations of the oscillations become 

* = A cos oaf, y — B sin <of (7.96) 

Hence we can conclude that the motion is counterclockwise. _ 

It follows from the above that uniform motion along a circle of 
radius R with the angular velocity © can be represented as the sum 
of two mutually perpendicular oscillations: 

x = R cos ©t, y = ±R sin ©t 


(7.97) 
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(the plus sign in the expression for y corresponds to counterclockwise 
motion, and the minus sign to clockwise motion). 

When the frequencies of mutually perpendicular oscillations differ 
by a very small value Aco, they can be considered as oscillations of 
an identical frequency, but with a slowly 
changing phase difference. Indeed, the 
equations of the oscillations can 
written as follows: 


be 


x = A cos (of, 

y = B cos [ cot + (Ac ot + a)] 

and the expression Ac + a can be 
considered as the phase difference slowly 
changing with time according to a 
linear law. 

The resultant motion in this case 
occurs along a slowly changing curve that will sequentially take 
on a shape corresponding to all the values of the phase difference 



if 


from — jx to + n. 

If the frequencies of mu- 
tually perpendicular oscil- 
lations are not identical, 
then the trajectory of the 
resultant motion has the 
shape of rather intricate 
curves called Lissajous 
figures. Figure 7.16 shows 
one of the simple trajecto- 
ries obtained at a frequency 
ratio of 1:2 and a phase 
difference of n/2. The equa- 
tions of the oscillations 
have the form 

a; = ,4 cos <o£, 

y = B cos {2( o£ + -y) 

During the time the particle 
manages to move from one 
extreme position to the 
other along the z-axis, it 
will be able to leave its zero position, reach one extreme position 
on the y-axis, then the other one, and return to its zero position. 

With a frequency ratio of 1:2 and a phase difference of zero, the 
trajectory degenerates into an open curve (Fig. 7.17) along which 
the particle moves to and fro. 
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The closer to unity is the rational fraction expressing the ratio 
of the frequencies of the oscillations, the more intricate is the Lis- 
sajous figure. Figure 7.18 shows as an example a curve for the frequen- 
cy ratio of 3 : 4 and the phase difference ji/2. 


7.10. Damped Oscillations 

Damped oscillations are described by Eq. (7.11): 

x -f- 2pz + a>pX = 0 
where, by Eqs. (7.10) and (7.6), 

2p = -^, ©|j = — — 

r m 0 m 

Here r is the resistance coefficient, i.e. the coefficient of proportion- 
ality between the velocity x and the force of resistance, and k is 
the quasi-elastic force coefficient. 

We must note that g> 0 is the frequency with which free oscilla- 
tions would take place in the absence of resistance of the medium 
(when r = 0). This frequency is called the natural frequency of the 
system. 

Introduction of the function x = exp (kt) into Eq. (7.11) leads 
to the characteristic equation 

X2 + 2pX + (o a 0 = 0 (7.98) 

The roots of this equation are 

+ ^=-P-VF ir < (7.99) 

When the damping is not too great (at p < © 0 ) the radicand will 
be negative. Let us write it in the form (ico) 2 , where © is a real quan- 
tity equal to 

©=y© 2 0 — p 2 (7.ioo) 

Here, the roots of the characteristic equation will be as follows: 

— p + i©, — p — i© (7.101) 

By Eq. (7.38), the general solution of Eq. (7.11) will be the func- 
tion 

x = + C 2 e(-P- io > t = <?-P‘ -f C 2 e- { “‘) 

The expression in parentheses is similar to Eq. (7.51). It can there- 
fore be written in a form similar to Eq. (7.55). Thus, when damping 
is not too great, the general solution of Eq. (7.11) has the form 

x = /l 0 e - P t cos(©i-|-a) (7.102) 
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Here A 0 and a are arbitrary constants, and co is a quantity deter- 
mined by Eq. (7.100). Figure 7.19 gives a graph of the function 
(7.102). The dash lines show the limits confining the displacement x 
of the oscillating particle. 

In accordance with the kind of the function (7.102), the motion 
of the system can be considered as a harmonic oscillation of frequen- 
cy co with an amplitude varying according to the law A (t) = 
= A 0 exp (— (to). The upper dash curve in Fig. 7.19 depicts the 


x 



function A (J), the quantity A 0 being the amplitude at the initial 
moment of time. The initial displacement x 0 , apart from A 0t also 
depends on the initial phase a: x 0 = A 0 cos a. 

The rate of damping of oscillations is determined by the quantity 
P = rl2m defined as the damping factor. Let us find the time x 
during which the amplitude diminishes e times. By definition, 
e -px = e -i, whence pr = 1. Consequently, the damping factor is 
the reciprocal of the time interval during which the amplitude 
diminishes e times. 

According to Eq. (7.56), the period of damped oscillations is 


T = 


2n 


(7.103) 


When the resistance of the medium is insignificant, the period of 
oscillations virtually equals T 0 = 2n/o) 0 . The period of oscilla- 
tions grows with an increasing damping factor. 

The following maximum displacements to either side (for exam- 
ple A\ A" y A m y etc. in Fig. 7.19) form a geometrical progression. 
Indeed, if A' = A 0 exp (— p*), then A* = A 0 exp [— *P (£ + T) 1 = 
= A ' exp (-p T)y A* = A 0 exp [-p ( t + 2T)) = A” exp (-p T) t 
etc. In general, the ratio of the values of the amplitudes correspond- 
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ing to moments of time differing by a period is 


A(t + T) 




This ratio is called the damping decrement, and its logarithm is 
called the logarithmic decrement: 


X = In 


j !i£) 

A(t + T) 


= P T 


(7.104) 


Ido not confuse with the constant X in Eqs. (7.98) and (7.101)11. 

To characterize an oscillatory system, the logarithmic decrement X 
is usually used. Expressing P through X and T in accordance with 
Eq. (7.104), we can write the law of diminishing of the amplitude 
with time in the form 

A = A 0 exp( — jr*} (7.105) 


In the interval during which the amplitude diminishes e times, 
the system manages to complete N e — x/T oscillations. We find 
from the condition exp ( — Xt IT) = exp ( — 1) that Xx IT = XN e = 1. 
Hence the logarithmic decrement is the reciprocal of the number of 
oscillations completed during the interval in which the amplitude 
diminishes e times. 

An oscillatory system is often also characterized by the quantity 


Q = ^ = nN e (7.106) 

called the quality, or simply the Q, of the system. As can be seen 
from its definition, the quality is proportional to the number of 
oscillations N e performed by the system in the interval x during 
which the amplitude of the oscillations diminishes e times. 

We established in Sec. 7.5 that the total energy of an oscillating 
system is proportional to the square of the amplitude [see Eq. (7.68)1. 
Accordingly, the energy of the system in damped oscillations dimin- 
ishes with time according to the law 

E = E 0 e~W (7.107) 

(E 0 is the value of the energy at t = 0). Time differentiation of this 
expression gives the rate of growth of the system’s energy: 

-££-= — 2pE 0 e- 2 <»= -2p£ 

By reversing the signs, we find the rate of diminishing of the energy: 

— -^- = 2p E (7.108) 
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If the energy changes only slightly during the time equal to a period 
of oscillations, the reduction of the energy during a period can be 
found by multiplying Eq. (7.108) by T: 

—A E = 2§TE 

(we remind our reader that A E stands for the increment, and — A E 
for the decrement of the energy). Finally, taking into consideration 
Eqs. (7.104) and (7.106), we 
arrive at the relation x 

(-AE) ~ 2n (7-109) 

from which it follows that upon 
slight damping of oscillations, 
the quality with an accuracy up 
to the factor 2 ji equals the ratio 
of the energy stored in the sys- 
tem at a given moment to the 
decrement of this energy during 
one period of oscillations. 

It follows from Eq. (7.103) 
that a growth in the damping Fig. 7.20 

factor is attended by an increase 

in the period of oscillations. At P = o 0 , the period of oscillations 
becomes infinite, i.e. the motion stops being periodic.' 

At p > (o 0 , the roots of the characteristic equation become real 
[see Eqs. (7.99)], and the solution of the differential equation (7.11) 
is equal to the sum of two exponents: 

x = Cje-M -f- C 2 e-M 

Here C ± and C 2 are real constants whose values depend on the initial 

conditions [on x 0 and v 0 = (x) 0 ]. The motion is therefore aperiod- 
ic — a system displaced from its equilibrium position returns to 
it without performing oscillations. Figure 7.20 shows two possible 
ways for a system to return to its equilibrium position in aperiodic 
motion. How the system arrives at its equilibrium position depends 
on the initial conditions. The motion depicted by curve 2 is obtained 
when the system begins to move from the position characterized 
by the displacement x 0 to its equilibrium position with the initial 
velocity v 0 determined by the condition 

K I > I *0 I (P + (7.110) 

This condition will be obeyed when a system brought out of its 
equilibrium position is given a sufficiently strong impetus toward 
it. If after displacing a system from its equilibrium position we 
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release it without an impetus (i.e. with v 0 = 0) or impart to it an 
impetus of insufficient force [such that v 0 is less than the value 
determined by the condition (7.110)], the motion will occur accord- 
ing to curve 1 in Fig. 7.20. 


7.11. Auto-Oscillations 

The energy of a system in damped oscillations is used to overcome 
the resistance of the medium. If this decrease of energy is replen- 
ished, the oscillations will become undamped. The energy of a system 



Fig. 7.21 


can be replenished at the expense of impetuses from outside, but 
they must be imparted to the system in step with its oscillations, 
otherwise they may weaken the latter and even stop them. An oscil- 
lating system can be made to control the external action itself, 
ensuring agreement between the impetuses imparted to it and its 
motion. Such a system is called an auto-oscillating one, and the 
undamped oscillations it performs are called auto-oscillations. 

Let us consider a clock mechanism as an example of an auto-oscil- 
latory system. The clock pendulum is fitted onto the same axis as 
a bent lever — the anchor (Fig. 7.21). The ends of the anchor carry 
projections of a special shape called pallets. The toothed escape 
wheel is acted upon by a chain with a weight or a wound up spring 
that tends to turn it clockwise. During the major part of the time, 
however, one of the wheel’s teeth bears against the side face of 
a pallet, the latter sliding along the tooth’s surface when the pendu- 
lum oscillates. Only when the pendulum is near its middle position 
do the pallets stop being in the way of the teeth, and the escape 
wheel turns, pushing the anchor by means of a tooth whose tip 
slides along the chamfered end of a pallet. During a complete cycle 
of pendulum oscillations (during a period), the escape wheel turns 
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through two teeth, and each of the pallets receives a push. These 
pushes, performed at the expense of the energy of a lifted weight or 
a wound up spring, are exactly what replenishes the decrease in 
the energy of the pendulum due to friction. 


7.12. Forced Oscillations 

When the driving force changes according to a harmonic law, the 
oscillations are described by the differential equation 

x + 20X -f coja: = / 0 coscd* (7.111) 

[see Eq. (7.13)1. Here p is the damping factor, oo 0 is the natural fre- 
quency of the system [see Eqs. (7.6), (7.10)1, / 0 = F 0 /m ( F 0 is the 
amplitude of the driving force), and co is the frequency of the force. 

Equation (7.111) is a non-homogeneous one. According to the 
theorem (7.41), the general solution of a non-homogeneous equation 
equals the sum of the general solution of the corresponding homoge- 
neous equation and the partial solution of the non-homogeneous 
one. We already know the general solution of a homogeneous equa- 
tion [see the function (7.102), which is the general solution of 
Eq. (7.11)1. It has the form 

x = A 0 e~ & cos (co7 + a) (7.112) 

where co' = ]/ coj — P 2 , and A 0 and a are arbitrary constants*. 

It remains for us to find the partial (containing no arbitrary con- 
stants) solution of Eq. (7.111). We shall use the procedure described 
at the end of Sec. 7.4 for this purpose. Let us add the imaginary func- 
tion if 0 sin co t to the function in the right-hand side of Eq. (7.111). 
After this, we can write the right-hand side in the form / 0 exp (ico*) 
[see Eq. (7.24)]. We thus arrive at the equation 

x + 2px + c = (7.113) 

It is easier to solve this equation than Eq. (7.111) because it is sim- 
pler to differentiate and integrate an exponent than trigonometric 
functions. 

We shall try to find the partial solution of Eq. (7.113) in the form 

i = Ae*°t (7.114) 

where 4 is a complex number. The function (7.114) is also complex, 
which has been indicated by capping the x . Time differentiation of 


♦ The symbol © without a prime stands for the frequency of the driving 
force* 
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this function yields 

= i(*Ae">\ ~ = — ©Me** (7.115) 


Introduction of Eqs. (7.114) and (7.115) into Eq. (7.113) and cancell- 
ing of the common factor e i<at give the algebraic equation 


Hence, 


— a> 2 A -f" -}- (£>\A = f q 


A = 


U 

(coj — <o*)-|-2ipa> 


(7.116) 


We have found the value of A at which the function (7.114) satisfies 
Eq. (7.113). Let us write the complex number in the denominator 
in the exponential form: 


(toj — © 2 ) + 2ip<n = pe** 
By Eqs. (7.18), we have j> 


p = Y (°^J — co 2 ) 2 + 4p 2 © 2 , (p = arctan 



(7.117) 

(7.118) 


Substitution of the denominator in Eq. (7.116) in accordance 
with Eq. (7.117) yields 


fo fa 

peM> 


Introduction of this value of A into (7.114) gives the partial solu- 
tion of Eq. (7.113): 

y fa g—i<tgia>t - f° e i <p) 

P P 


Finally, by taking the real part of this function, we get the partial 
solution of Eq. (7.111): 

# = -^cos(co* — <p) 


Introduction of the values of f 0 , and also of the values of p and <p 
from Eqs. (7.118), gives the final expression 


Fq/m 


=- COS ( at — arctan ) (7.119) 


\/ (co| — co 2 ) 2 ~ 4P 2 co : 


We must note that the function (7.119) contains no arbitrary con- 
stants. 

Let us obtain a partial solution of Eq. (7.111) in another way — 
with the aid of a vector diagram. We shall assume that the partial 
solution of Eq. (7.111) has the form 

x = A cos (a )t — <p) 


( 7 . 120 ) 
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Hence, 

x— — ©Asin(©* — <p) = ©.4cos (©* — 9 + y) (7.121) 

« • 

— © 2 .<4cos(©f — <p) = © 2 .<4 cos (<ot — <p + Jt) (7.122) 

The use of Eqs. (7.120)-(7.122) in Eq. (7.111) yields 

© 2 .4coa(©* — q> + n) + 2p©A cos (©f — T + 'f') + 

+ ©J4cos(©£ — <p) = / 0 cos©f (7.123) 

It follows from Eq. (7.123) that the constants A and <p must have 
values such that the harmonic function f 0 cos cat will equal the sum 




Fig. 7.22 


of the three harmonic functions in the left-hand side of the equation. 
If we depict the function ©*.4 cos (©f — <p) by a vector of length 
©J.4. directed to the right (Fig. 7.22), then the function 2p©yl X 
X cos (cot — <p+Jt/2) will be depicted by a vector of length 2$coA 
turned counterclockwise relative to the vector ©jf.4 through the 
angle jt/2 (see Sec. 7.7), and the function a> 2 A cos ( ©f — <p + ji) 
by a vector of length a> 2 A turned through the angle n relative to the 
vector ©*4. For Eq. (7.123) to be satisfied, the sum of the three enu- 
merated vectors must coincide with the vector depicting the function 
/ 0 cos ©f. Inspection of Fig. 7.22a shows that such coincidence is 
possible only at a value of the amplitude A determined by the con- 
dition 

(©J — © 2 ) A 2 + 40 2 © 2 ,4 2 = f» 

whence 


A = 


F„/m 

Y (©?— to 2 )* +40*©* 


(7.124) 


(we have replaced / 0 with the ratio F 0 /m). Figure 7.22a corresponds 
to the case when © < © 0 . We get the same value of A from Fig. 7.226 
corresponding to the case when © > © 0 . 
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Figure 7.22 also allows us to obtain the value of <p showing the 
lagging in phase of the forced oscillation (7.120) behind the driving 
force producing it. It can be seen from the figure that 

tan< P=I^r (7.125) 

Using the values of A and <p determined by Eqs. (7.124) and (7.125) 
in Eq. (7.120), we get the function (7.119). 


x 



Function (7.119) when added to Eq. (7.112) gives the general 
solution of Eq. (7.111) describing the behaviour of the system upon 
forced oscillations. Addend (7.112) plays an appreciable part only 
in the initial stage of the process, during the so-called setting in of 
the oscillations (Fig. 7.23). With the passage of time, owing to the 
exponential factor exp ( — P^), the part played by addend (7.112) 
diminishes to a greater and greater extent, and after sufficient time 
elapses it may be disregarded, retaining only addend (7.119) in 
the solution. 

Thus, function (7.119) describes steady-state forced oscillations. 
They are harmonic oscillations with a frequency equal to that of 
the driving force. The amplitude (7.124) of the forced oscillations is 
proportional to that of the driving force. The amplitude of a given 
oscillatory system (determined by co 0 and P) depends on the fre- 
quency of the driving force. Forced oscillations lag in phase behind 
their driving force; the lagging q> also depends on the frequency of 
the force [see Eq. (7.125)]. 

As a result of the amplitude of forced oscillations depending on 
the frequency of the driving force, the amplitude of the oscillations 
reaches a maximum value at a definite frequency for the given sys- 
tem. The oscillatory system responds especially to the action of 
the driving force at this frequency. This phenomenon is called 
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resonance, and the corresponding frequency — the resonance fre- 
quency. 

To determine the resonance frequency a> re s, we must find the maxi- 
mum of the function (7.1z4) or, which is the same, the minimum of 
the expression inside the 
radical in the denominator. 

Differentiating this expres- 
sion w ith r espect to o and 
equating it to zero, we get 
the condition determining 

Ores* 

— 4 (oj — (o 2 ) cd + 8{J 2 g) = 0 
(7.126) 

Equation (7.126) has 
three solutions: co = 0 and 

(0 = ±V — 2p 2 . The so- 
lution equal to zero corres- 
ponds to a maximum of the 
denominator. Of the remaining two solutions, the negative one 
must be discarded as being deprived of a physical meaning (the 
frequency cannot be negative). We thus get a single value for the 
resonance frequency: 

©re8 = K©S-2p2 (7.127) 



Using this value of the frequency in Eq. (7.124), we get an expres- 
sion for the amplitude in resonance: 


^res = 




(7.128) 


It follows from Eq. (7.128) that the amplitude in resonance would 
become equal to infinity in the absence of resistance of the medium. 
By Eq. (7.127), the resonance frequency in such conditions (at p = 0) 
coincides with the natural frequency of oscillations of the sys- 
tem (D 0 . 

The dependence of the amplitude of forced oscillations on the 
frequency of the driving force (or, which is the same, on the fre- 
quency of oscillations) is shown graphically in Fig. 7.24. The sepa- 
rate curves correspond to different values of the parameter p. Accord- 
ing to Eqs. (7.127) and (7.128), the peak of a given curve is higher 
and further to the right with decreasing values of p. The expression 
for the resonance frequency becomes imaginary upon very great 
damping (such that 2p 2 > a> 2 ). This signifies that no resonance is 
observed in these conditions — the amplitude of forced oscillations 
monotonously diminishes with increasing frequency (see the lower 
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curve in Fig. 7.24). The curves shown in Fig. 7.24 corresponding to 
different values of the parameter p are called resonance curves. 

We can add the following remarks with respect to resonance curves. 
When <o tends to zero, all the curves arrive at the same limiting 
value equal to FJmto J, i.e. FJk , differing from zero. This value 
is the displacement from the equilibrium position received by the 
system under the action of a constant force of magnitude F 0 . When 



Fig. 7.25 


(o tends to infinity, all the curves asymptotically tend to zero because 
at a high frequency the force changes its direction so rapidly that 
the system does not manage to become displaced from its equilibrium 
position. Finally, we must note that diminishing of p is attended by 
a greater change in the amplitude with the frequency near resonance 
and by a sharper “peak”. 

It follows from Eq. (7.128) that with small damping (i.e. when 
P co 0 ), the amplitude in resonance is 


^res ~ 


FJm 

2pa) 0 


Let us divide this expression by the displacement x 0 from the equi- 
librium position under the action of the constant force F 0 equal 
to FJm&l. The result is 


^res ^ 2jt ja q 

x 0 ~ 20 ~ 2pr X 


(7.129) 


[see Eq. (7.106)]. Thus, the quality Q shows how many times the 
amplitude at the moment of resonance exceeds the displacement of 
the system from its equilibrium position under the action of a con- 
stant force of the same magnitude as the amplitude of the driving 
force (this holds only with slight damping). 

Inspection of Fig. 7.22 shows that forced oscillations lag in phase 
behind their driving force; this lagging ranges from 0 to n. The 
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dependence of (p on co at various values of P is shown in Fig. 7.25. 
The value q> = jt/2 corresponds to the frequency co 0 . The resonance 
frequency is lower than the natural one [see Eq. (7.127)1. Hence, 
at the moment of resonance, q> < Jt/2. When the damping is insigni- 
ficant, (o r es « o) 0 , and we may assume that in resonance <p = n/2. 

The phenomenon of resonance must never be forgotten in designing 
machines and various structures. The natural frequency of oscilla- 
tions of such equipment and facilities must not be close to the fre- 
quency of possible external actions. For example, the natural fre- 
quency of vibrations of a ship’s hull or an aeroplane’s wings must 
greatly differ from the frequency of the vibrations that might be 
produced by rotation of the propeller. Otherwise vibrations will 
appear that may cause a catastrophe. Cases are known when bridges 
collapsed owing to the marching of columns of soldiers over them. 
The reason was that the natural frequency of 
oscillations of the bridge was close to the fre- 
quency of the soldier’s steps. 

The phenomenon of resonance, at the same 
time, is often very useful, especially in acoustics, 
radio engineering, etc. 


7.13. Parametric Resonance 

In the case dealt with in the preceding section, 
a driving force applied from outside produced 
a direct displacement of a system from its equilib- Fig. 7.26 

rium position. Another kind of external action is 
known to exist by means of which great oscillations can be imparted 
to a system. This kind of action consists in periodically changing 
a parameter of the system in step with its oscillations, owing to 
which the phenomenon is called parametric resonance. 

Let us take as an example a simple pendulum — a ball on a thread. 
If we periodically change the length l of the pendulum, increasing 
it when the pendulum is at its extreme positions and decreasing 
it when the pendulum is at its middle position (Fig. 7.26), then the 
pendulum starts swinging violently. The energy of the pendulum here 
grows at the expense of the work done by the force acting on the 
thread. The force tensioning the thread is not constant when the 
pendulum oscillates: it is smaller at the extreme positions when the 
velocity vanishes, and is greater at the middle position when the 
velocity of the pendulum is maximum. Consequently, the negative 
work of the external force upon elongation of the pendulum is smaller 
in magnitude than the positive work done upon shortening of the 
pendulum. As a result, the work done by the external force during 
a period is greater than zero. 



CHAPTER 8 RELATIVISTIC 

MECHANICS 


8.1. The Special Theory of Relativity 

It was noted in Sec. 2.1 that Newtonian mechanics holds only 
for bodies travelling with speeds that are much lower than the speed 
of light in a vacuum (this speed is denoted by the symbol c). To 
describe motion at speeds comparable with c , Albert Einstein ad- 
vanced relativistic mechanics, i.e. mechanics taking the requirements 
of the special theory of relativity into account. 

The special theory of relativity presented by Einstein in 1905 is 
a physical theory of space and time*. The foundation of this theory 
is formed by two postulates called Einstein’s principle of relativity 
and the principle of constancy of the speed of light. 

Einstein’s principle of relativity is an extension of Galileo’ 
mechanical principle (see Sec. 2.7) to all physical phenomena with- 
out any exception. According to this principle, all laws of nature 
are the same in all inertial reference frames . The unchanged form 
of an equation when the coordinates and time of one reference frame 
are replaced in it with the coordinates and time of another frame is 
called the invariance of the equation. The principle of relativity 
can therefore be formulated as follows: the equations expressing the 
laws of nature are invariant with respect to transformations of coordi- 
nates and time from one inertial reference frame to another . 

The principle of constancy of the speed of light states that the 
speed of light in a vacuum is the same in all inertial reference frames 
and does not depend on the motion of the sources and receivers of light** 

A number of important conclusions relating to the properties of 
space and time follow from the above postulates. Space and time 
were considered independently of each other in Newtonian mechan- 
ics. Newton considered that absolute space and absolute time exist. 
He defined absolute space as a container of articles that always 
remains the same and stationary and that has no relation to anything 
external. Newton wrote about time that absolute, true, or mathe- 
matical time flows uniformly without any relation to anything 
external by itself and owing to its internal nature. Accordingly, 
it was considered absolutely obvious that two events occurring 


* In 1915, Einstein presented the fundamentals of the general theory of rel- 
ativity, which is the theory of gravitation. 

** The experiment performed by A. Michelson and E. Morley confirming the 
validity of this principle will be described in the second volume of our course. 
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simultaneously in one reference frame will also be simultaneous in 
all other reference frames. It is easy to see, however, that the latter 
statement contradicts the principle of the constancy of the speed 
of light. 

Let us take two bodies K and K' forming inertial reference frames 
together with their corresponding clocks. Assume that body K' 
moves relative to body K with the velocity v 0 directed along the 
straight line passing through the centres of the bodies (Fig. 8.1). 
Let us put two bodies M and N on this line. The bodies are at equal 



distances from body K' and are rigidly joined to it. These bodies 
move relative to hody K with the velocity v 0 , and are at rest rel- 
ative to body K'. Let us consider the same process in both frames, 
namely, the emission of a light signal from the centre of body K' 
and its reaching bodies M and N . The speed of light in all directions 
is the same and equals c. Hence, in the reference frame K\ the signal 
will reach bodies M and N at the same moment t'. 

In the reference frame K , light also propagates in all directions 
with the speed c. In this frame, M moves toward the light signal. 
Body N moves in the same direction as the signal. Consequently, the 
signal reaches M before it reaches N, and therefore t M < t N . Thus, 
the events that were simultaneous in the frame K' will not be simul- 
taneous in the frame K . Hence it follows that time flows differently 
in different reference frames. 

To describe an event in a reference frame, we must indicate the 
place and the moment at which it occurs. This task can be coped 
with if we set up equally spaced coordinate marks in space and put 
a clock at each mark that will permit us to determine the moment 
at which the event occurs at the given place. The coordinate marks 
can be made by transferring a unit scale. Any system performing a pe- 
riodically repeating process can be used as*a clock. To compare the 
moments at which two events occur at different points of space, we 
must see that the clocks at these points are synchronized. 

It would seem possible to synchronize the clocks by first placing 
them next to one another, and then, after comparing their readings, 
by transferring them to the corresponding points of space. Such 
a method must be rejected, however, because we do not know how 
the transfer of the clocks from one place to another will affect their 
running. We must therefore first put the clocks at their relevant 
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places and only then compare their readings. This can be done by send- 
ing a light signal from one clock to the others*. Assume that a light 
signal is sent from point A at the moment t x (according to the clock 
at A). The signal is reflected from a mirror at point B and returns 
to A at the moment t 2 . The clock at B should be considered synchro- 
nized with the one at A if at the moment when the signal reaches it 
the clock at B shows the time t equal to (t x + t 2 )!2 . This procedure 
must be performed for all the clocks arranged at the different points 
of the frame K . The events at A and B will be considered simulta- 
neous in the frame K if the readings of the clocks at A and B corres- 
ponding to them coincide. 

All the clocks in the frame K' and in any other inertial reference 
frame are synchronized in a similar way. The speed of the light 
signal used for synchronization is the same in all the inertial refer- 
ence frames. This explains why it is a light signal that is chosen 
as the signal for clock synchronization. The speed of light was found 
to be the limit. No signal, no action of one body on another can 
propagate with a speed exceeding that of light in a vacuum. This is 
the reason for light having the same speed in a vacuum in all refer- 
ence frames. According to the principle of relativity, the laws of 
nature in all inertial frames must be identical. The circumstance 
that the speed of a signal cannot exceed a limiting value is also a 
law of nature. Hence, the value of the limiting speed must be the 
same in all reference frames. 

The constancy of the speed of light results in space and time being 
mutually related, forming a single space-time. This relation can 
be depicted especially clearly with the aid of an imaginary four- 
dimensional space along three axes of which the space coordinates 

x , y, z are laid off, and along the fourth axis the time t , more exactly 
the time coordinate ct proportional to t and having the same dimen- 
sion as the space coordinates. 

An event (for instance, the decay of a particle) is characterized 
by the place where it occurred (by the coordinates x, y, z) and by 
the time t when it occurred. Thus, a point with the coordinates x y 

y, z, ct corresponds to an event in our imaginary four-dimensional 
space. This point is called the world point. A line called the world 
line corresponds to any particle (even a stationary one) in four- 
dimensional space (for a particle at rest it has the form of a straight 
line parallel to the cJ-axis). 

Thus, space and time are parts of a single whole. But time differs 
qualitatively from space. This manifests itself in that our imaginary 
four-dimensional space differs in its properties from the conventional 
three-dimensional space. The latter has Euclidean metric. This 


* The checking of clocks according to radio signals is in essence such syn- 
chronization. 
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signifies that the square of the distance A l between two points equals 
the sum of the squares of the coordinate differences: 

Al 2 = Ax 2 + A y 2 + A z 2 

The square of the “distance” between two world points (this dis- 
tance is called an interval and is designated by the symbol As) is deter- 
mined by the equation 

As 2 - c 2 At 2 — Ax 2 — Ay 2 — A z % (8.1) 

(the properties of an interval are treated in Sec. 8.4). 

Spaces for which the square of the distance is determined by a 
formula such as (8.1) are called pseudo-Euclidean. The qualitative 
distinction between time and space manifests itself in that the square 
of the time coordinate and the squares of the space coordinates enter 
Eq. (8.1) with different signs. 

A distinctive part in the special theory of relativity is played 
by quantities that are invariant with respect to the transformations 
of the coordinates and time from one inertial reference frame to 
another (in other words, quantities having the same numerical value 
in all inertial reference frames). We know one such quantity, namely, 
the speed of light in a vacuum. We shall show in Sec. 8.4 that the 
interval defined by Eq. (8.1) is also an invariant. 

A distinctive part is also played by equations and relations that 
are invariant with respect to the transformations indicated above 
(i.e. having the same form in all inertial reference frames). For exam- 
ple, the relativistic expressions for the momentum and energy are 
determined so that the laws of conservation of these quantities are 
not violated when transferring to another inertial reference frame. 
We shall acquaint ourselves with a number of invariant quantities 
and relations in our further treatment. 




8.2. Lorentz Transformations 

Let us consider two inertial reference frames K and K' (Fig. 8.2). 
Assume that the frame K' moves relative to the frame K with the 
velocity vj. Let us direct the axes x and x along the vector v 0 , 
and assume that the axes y and z are parallel to the axes y f and z', 
respectively. 


* We remind our reader that the name inertial is used to designate a ref- 
erence frame relative to which a free particle moves without acceleration (see 
Sec. 2.2). In Sec. 2.7, we showed on the basis of the Galilean transformation 
that the frame K' moving relative to the inertial frame K with the constant ve- 
locity v 0 is also inertial, in turn. In relativistic mechanics, the Galilean trans- 
formations have to be replaced with other ones that agree with the principle 
of the constancy of the speed of light. It is clear, however, that no matter what 
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Owing to the principle of relativity, the frames K and K' have 
absolutely equal rights. Their only formal distinction is that the 
x-coordinate of origin O' of the frame K' measured in the frame K 
changes according to the law 


x 0 » = v 0 t (8.2) 

whereas the x'-coordinate of origin O of the frame K measured in 
the frame K f changes according to the law 

x' 0 =—v 0 t’ (8.3) 

This distinction is due to the fact that we have chosen identical 
directions of the axes x and x', but the frames K and K' move in 
opposite directions relative to each other. Hence, the projection of 

the relative velocity of the frame K 
onto the x-axis is v 0 , and that of the 
frame K' onto the x'-axis is — i? 0 . 

In non-relativistic mechanics, we 
used the Galilean transformation (2.19) 
to pass over from the coordinates and 
time of one inertial reference frame to 
the coordinates and time of another 
inertial frame. The rule of velocity 
addition v = v' + v 0 [see Eq. (2.21)] 
follows from these transformations. This rule contradicts the principle 
of constancy of the speed of light. Indeed, if in the frame K' a light 
signal propagates in the direction of the vector v 0 with the veloc- 
ity c, then according to Eq. (2.21) in the frame K the velocity of 
the signal will be c + v 0 , i.e. it will exceed c. Hence, it follows that 
the Galilean transformations must be replaced with other formulas. 
It is not difficult to find the latter. 

In the most general form, the transformations of the coordinates 
and time from the frame K f to the frame K are as follows: 



x = f l (x',y',z',t'), y = h(x\y’,z’,t’) 1 
z = f 3 (x’,y',z',t% t = U(x',y\z',t')j (S4) 

It follows from the uniformity of time and space that Eqs. (8.4) 
should be linear, i.e. have the form 

x — aix' 4^ ct 2 y' +a 3 z' + ct 4 *' +a 5 (8.5) 


the law of transformation is when passing from the frame K to the frame K' 
moving relative to it with the constant velocity v 0 , if the velocity v of a parti- 
cle in the frame K is constant, then its velocity v' in the frame K' will also be 
constant. Consequently, in relativistic mechanics too, the frame K' moving with 
a constant velocity v 0 relative to the inertial frame K will also be inertial. 
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and so on, where a lf a 2 , . . . are constants. Accordingly 

dx = a t dx' + a 2 dy' + a 3 dz' + a 4 d*' (8.6) 

and so on. 

Indeed, according to Eqs. (8.4) 

+ (8.7) 


If we take the arbitrarily chosen values dx\ dy\ dz\ dt' for the point 
x[ y y[> z i » ^en upon introducing into Eqs. (8.7) the values of 
the derivatives at the given point, we get a certain value dx t for dx. 
Owing to the uniformity of space and time, however, for any other 
point x', y' 2 , z' 2 , t 2 at the same values dx\ dy' , dz' , dt' we should get 
the same value for dx as for the first point, i.e. we should have dx 2 = 
=dx 1 . The same should hold for dy, dz, and dt . Since dx', dy ' , dz', 
dt' were chosen absolutely arbitrarily, this requirement can be 
observed only if the derivatives dfjdx , etc. do not depend on the 
coordinates, i.e. are constants. Hence follows Eq. (8.6), and then 
also Eq. (8.5). 

With the choice of the coordinate axes shown in Fig. 8.2, the 
plane y = 0 coincides with the plane y' = CL. and the plane z = 0 
with the plane z = 0. It thus follows that, for example, the coordi- 
nates y and y' must become equal to zero simultaneously regardless 
of the values of the other coordinates and time. Therefore, y and y' 
can be related only by expressions of the kind 

y = *y' 

where e is a constant. Owing to the frames K and K' having equal 
rights, the reverse relation must hold, i.e. 

y' = sy 


with the same value of the constant e as in the first case. Multiplica- 
tion of these two equations yields e 2 = 1, whence e = ±1- The 
plus sign corresponds to the axes y and y' having the same directions, 
and the minus sign to their having opposite directions. Giving the 
axes the same direction, we get 


y = y' 

Similar reasoning yields 

z = z' 


( 8 . 8 ) 

(8.9) 


Now let us turn to finding the transformations for x and t. It can 
be seen from Eqs. (8.8) and (8.9) that the values of y and z do not 
depend on x' and t'. Hence, the values of x' and t' cannot depend ony 
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and z, correspondingly, the values of x and t cannot depend on y' 
and z\ Thus, x and t can be linear functions of only x f and t'. 

The origin of coordinates 0 of the frame K has the coordinate 
x = 0 in the frame K and x = — v 0 t' in the frame K' [see Eq. (8.3)]. 
Consequently, the expression (x + v 0 t') must vanish simultaneously 
with the coordinate x . For this to occur, the linear transformation 
should have the form 

x = y(x' -\-v 0 t') (8.10) 

where y is a constant. 

Similarly, the origin of coordinates O' of the frame K ' has the 
coordinate x = 0 in the frame K' and x = v 0 t in the frame K [see 
Eq. (8.2)]. Hence, 

x' = y(x — v 0 t) (8.11) 

It follows from the frames K and K' having equal rights that the 
constant of proportionality in both cases should be the same. 

We shall use the principle of constancy of the speed of light to 
find the constant y . Let us begin to count the time in both frames 
from the moment when their origins of coordinates coincide. As- 
sume that at the moment t = t' = 0 a light signal is sent in the 
direction of the axes x and x' that causes a flash of light to appear 
on a screen at a point with the coordinate x in the frame K and with 
the coordinate x in the frame K' . This event (flash) is described by 
the coordinate x and the moment t in the frame K y and by the coor- 
dinate x and the moment t' in the frame K\ and 

X = ct i X = ct' 


Using these values of x and x ' in Eqs. (8.10) and (8.11), we get 
ct = y ( ct ' + v Q t’) = y (c + v 0 ) t' 
ct' =y(ct — v 0 t) = y(c — v 0 )t 
Multiplication of these two equations yields 

c 2 — y2 ( c 2 yj) 


Hence, 


v 1 
y ~ /i— 


Introduction of this value into Eq. (8.10) gives 

z’ + Vot' 

Y l— vl/c 2 


( 8 . 12 ) 

(8.13) 


Equation (8.13) allows us to find the value of x according to known 
values of x' and t’ . To obtain an equation allowing us to find the 
value of t according to the known values of x' and let us delete 
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the coordinate x from Eqs. (8.10) and (8.11) and solve the resulting 
expression relative to t. We obtain 


Substituting for y its value from Eq. (8.12), we have 

t= t' + fa/c*)*' 

/l— t He* 


(8.14) 


The combination of Eqs. (8.8), (8.9), (8.13), and (8.14) is called 
Lorentz transformations. If we use the generally adopted symbol 


P = 


C 


then the Lorentz transformations acquire the form 


T _ s' + P c *' 


y = y\ z = z\ 


*' + (P/c)*' 


(8.15) 


(8.16) 


Equations (8.16) allow us to pass over from coordinates and time 
measured in the frame K' to those measured in the frame K (in short, 
to pass over from the frame K ' to the frame K ). If we solve Eqs. (8.16) 
relative to the primed quantities, we get the equations for trans- 
formation from the frame K to K 


— P ct 
x — 1-p* 


f 


. y' = y, 

* — (P/c)g 


z' = z. 


(8.17) 


As should be expected with a view to the equal rights of the frames 
K and K' , Eqs. (8.17) differ from their counterparts (8.16) only in 
the sign of p, i.e. of i; 0 . 

It is easy to understand that when v 0 <C c (i.e. p C 1), the Lorentz 
transformations become the same as the Galilean ones [see Eqs. (2.19)1. 
The latter thus retain their importance for speeds that are small in 
comparison with the speed of light in a vacuum. 

When u 0 > c, Eqs. (8.16) and (8.17) for x , t , x\ and t' become ima- 
ginary. This agrees with the fact that motion at a speed exceeding 
that of light in a vacuum is impossible. It is impossible even to use 
a reference frame moving with the speed c because when i; 0 = c, 
we get zero in the denominators of the equations for x and t . 

The Lorentz transformations have an especially simple and sym- 
metrical form if we write them for x and (c£) instead of for x and 
i.e. for quantities of the same dimension. In this case, Eqs. (8.16) 
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have the form 


x 


x' + p (cQ 
/l-P a ’ 

{ct) = 


y = y‘, 

(et’) + Bx' 

/i-P a 


Z = Z 


( 8 . 18 ) 


It is simple to memorize Eqs. (8.18) by bearing in mind that the 
first of them differs from the “obvious” equation x = x + v 0 t' 
in containing in its denominator the expr ssion V 1 — p 2 character- 
istic of relativistic formulas. The last equation is obtained from 
the first one if we change the places of x f and ct\ 


8.3. Corollaries of 

the Lorentz Transformations 

A number of corollaries follow from the Lorentz transformations 
that are unusual from the viewpoint of Newtonian mechanics. 

Simultaneity of Events in Different Reference Frames. Assume 
that two events occur simultaneously in the frame K at points with 
the coordinates x x and x 2 and at the moment t x = t 2 — b. According 
to the last of the equations (8.17), the moments 

f ' = fr — (P/g) x 1 = b— (ft/c) x % 

1 ’ 2 

will correspond to these events in the frame K\ Examination of these 
equations shows that if the events occur at different points of space 
(xj x 2 ) in the frame K , then they will not be simultaneous in the 
frame K' (£' t' 2 ). The sign of the difference t' 2 — t[ is determined 

by that of the expression (P !c) {x x — x 2 ). Consequently, in different 
frames K ' (with different p’s), the difference t' 2 — t[ will vary in 
magnitude and may differ in sign. This signifies that in some frames 
event 1 will precede event 2, whereas in others, on the contrary, 
event 2 will precede event 1. It must be noted that what has been 
said above relates only to events between which there is no causal 
relationship. Causally related events (for example, the throwing 
of a stone and its falling onto the Earth) will not be simultaneous 
in any reference frame, and in all frames the event that is the cause 
will precede the effect. This will be treated in greater detail in the 
following section. 

The Length of Bodies in Different Frames. Let us consider a rod 
arranged along the x'-axis and at rest relative to the reference frame 
K' (Fig. 8.3). Its length in this frame is l 0 = x ' — x', where x\ 
and x' are the coordinates of the rod ends that do not change with 
the time t'. The rod travels with the velocity v = u 0 relative to the 
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frame K . To determine its length in this frame, we must note the 
coordinates of the rod ends x x and x 2 at the same moment t x = t 2 = 
= &. Their difference Z = x 2 — x x will give the length of the rod meas- 
ured in the frame K. To find the relationship between Z 0 and Z, 
we must take the equation of the Lorentz transformations that con- 
tains x , x, and Z, i.e. the first of the equations (8.17). Substituting 
vjc for p in this equation, we obtain 


x\ 


Xi — v 0 b 


Yi — vile 2 * 


X 2 = 


x 2 — v 0 b 

Y 1 — 


whence 




/r 




X 2 — Xj = 




t>£ 


12l 


.r «r' 


Using the symbols Z and Z 0 and also Fig. 8.3 

replacing the relative velocity of the 

reference frames u 0 with the velocity v of the rod relative to 
the frame K that equals it, we arrive at the expression 


Z 



(8.19) 


Thus, the length of the rod Z measured in a frame relative to 
which it is moving is shorter than the length Z 0 measured in the 
frame relative to which the rod is at rest.* 

If a rod of length l Q ~ x 2 — x ± is at rest relative to the frame K y 
then to determine its length in the frame K' we must note the coor- 
dinates of its ends x' and x\ at the same moment t[ = Z' = b . The 
difference Z = x' — x' gives the length of the rod in the frame K' 
relative to which it is moving with the velocity v. Using the first 
of the equations (8.16), we again arrive at Eq. (8.19). 

It must be noted that the dimensions of the rod are identical in 
all the reference frames in the direction of the axes y and z . 

Thus, in moving bodies, their dimensions contract in the direc- 
tion of their motion the greater, the higher is the velocity. This 
phenomenon is called the Lorentz (or Fitzgerald) contraction. It is 
interesting to note that the change in the shape of bodies even at 
velocities comparable with c cannot be detected visually (or in a 
photograph). The reason is very simple. When observing visually 
or photographing a body, we register light pulses from different 
points of the body that reach the retina of our eye or the photographic 
plate simultaneously. These pulses, however, are not emitted simul- 
taneously. The pulses from the more remote sections were emitted 
before those from the nearer sections. Thus, if the body is moving. 


* The length l 0 measured in the frame relative to which the rod is at rest 
is called the proper length of the rod. 
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a distorted image of it is formed on the retina of the eye or on the 
photograph. The relevant calculations show that this distortion 
will result in compensation of the Lorentz contraction* so that the 
bodies seem to be only turned instead of distorted. Consequently, 
a spherically shaped body even at high velocities will be perceived 
visually as a body with a spherical configuration. 

Length of Time Between Events. Let us suppose that two events 
occur at the same point of the frame K ' . The coordinate x[ = a and 
the moment t[ correspond to the first event in this frame, and 
the coordinate x 2 = a and the moment t' 2 to the second one. Accord- 
ing to the last of the equations (8.16), the moments corresponding 
to these events in the frame K will be (we have introduced vjc 
instead of P) 

t = t i + ( v o/ c *) a t _ t%+(vjc*)a 

4 Y i — v V c% * 2 Yt — 


Hence, 


t 2 — t i 


Y 1 — v V c2 


Introducing the notation 
get the equation 


Yi^l 7? v ' 

that relates the lengths of time between two events measured in the 
frames K and K\ We remind our reader that in the frame K' both 
events occur at the same point, i.e. x[ = x' 2 . 

Assume that both events occur with the same particle that is 
at rest in the frame K' and is moving relative to the frame K with 
the velocity v = v 0 . Therefore, At' can be interpreted as the length 
of time measured on a clock that is stationary relative to the particle, 
or, in other words, measured on a clock that is moving together 
with the particle (we have in mind motion relative to the frame 
K). The time measured on a clock moving together with a body 
is called the proper time of this body and is usually denoted by the 
symbol t. Thus, At' = At. We can thus write Eq. (8.20) as follows: 


t 2 — = At and t 2 — t[ = A t\ we 

A t' 

/o on\ 


AT = A*j/l-£ (8.21) 

(we have replaced the relative velocity of the reference frames v 0 
with the velocity of the particle v equal to it). 

Equation (8.21) relates the proper time of a body t to the time 
t read on a clock of a reference frame relative to which the body is 


♦ If there were no Lorentz contraction, rapidly moving bodies ought to seem 
extended in the direction of their motion. 
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moving with the velocity v (this clock itself is moving relative to the 
body with the velocity — v ). 

A glance at Eq. (8.21) shows that the proper time is always smaller 
than the time measured on a clock moving relative to a body (in 
the latter case the effect called time dilation is observed). We shall 
show in the following section that the proper time is an invariant 
(i.e. is identical in all reference frames). 

Considering the events occurring with the particle in the frame K y 
we can define At as the length of time measured on a stationary clock, 
and At as the length of time measured on a clock moving 
with the velocity v. By Eq. (8.21), we have At < At . We can there- 
fore say that the moving clock runs slower than the clock at rest 
(it must not be forgotten that in all respects except for their velocity 
the clocks are absolutely identical). 

Equation (8.21) has been directly confirmed experimentally. Cos- 
mic rays contain particles called mu-mesons or muons. These parti- 
cles are unstable and decay spontaneously into an electron (or posi- 
tron) and two neutrinos. The mean lifetime of muons measured in 
conditions when they are stationary (or are moving with a low veloc- 
ity) is about 2 X 10 “ 6 s. It would seem that even when travelling 
with the speed of light, muons could cover a distance of only about 
600 m. As observations show, however, muons are formed in cosmic 
rays at an altitude of from 20 to 30 km, and a considerable number 
of them manage to reach the Earth’s surface. The explanation is 
that 2 X 10” 6 s is the proper lifetime of a muon, i.e. time measured 
on a clock travelling together with it. The time according to the 
clock of an observer on the Earth will be much greater [see Eq. (8.21); 
v of a muon is close to c]. It is therefore not surprising that the observ- 
er registers a distance travelled by a muon much greater than 600 m. 
We must note that from the position of an observer travelling togeth- 
er with a muon, the distance it covers to the Earth’s surface con- 
tracts to 600 m [see Eq. (8.19)], so that the muon manages to travel 
this distance in 2 X 10" 6 s. 


8.4. Interval 

We pointed out in Sec. 8.1 that a world point with the coordi- 
nates ct y x, y, z can be associated with every event in imaginary four- 
dimensional space. Let one event have the coordinates ct ly x ly y l7 z l7 
and another the coordinates ct 2 , x 2 , y 2 , z 2 - We shall introduce the 
notation t 2 — t x = At, x 2 — x x — Ax, etc. 

We remind our reader that owing to the qualitative distinction 
between time and space, the square of the difference between the 
time coordinates ( cAt ) 2 and the squares of the differences between 
the space coordinates Ax 2 , Ay 2 , A z 2 enter the expression for the square 
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of the “distance” between events (more exactly, between the 
world points corresponding to the events) with opposite signs: 

As 2 = c 2 At 2 — Ax 2 — Ay 2 — Az 2 (8.22) 

The quantity determined by this equation is defined as the 
interval between events. 

Introducing the distance A l = Y A:r 2 + Ay 2 + Az 2 between the 
points of conventional three-dimensional space at which the events 
being considered occurred, the expression for the interval can be 
written in the form 

As = lA: 2 A* 2 — AZ 2 (8.23) 

It is easy to convince ourselves that the interval between two 
given events is the same in all inertial reference frames. It is exactly 
this circumstance that served as the grounds to consider it the ana- 
logue of the distance A l between two points in conventional three- 
dimensional space (AZ does not change its value when we pass over 
from one three-dimensional reference frame to another). 

Assume that in the reference frame K the square of the interval 
is determined by Eq. (8.22). The square of the interval between 
the same events in the frame K' is 

As' 2 = c 2 At' 2 — Ax' 2 — Ay' 2 — Az' 2 (8.24) 


By Eqs. (8.17) 

Ax ' = az'=az, 

Af' = 

- 

Introducing these values into Eq. (8.24), after simple transforma- 
tions we find that As' 2 = c 2 At 2 — Ax 2 — Ay 2 — Az 2 , i.e. that 

As' 2 = As 2 


The interval is thus invariant with respect to a transition from 
one inertial reference frame to another. We saw in the preceding sec- 
tion that the lengths of time At and lengths A l are not invariant 
with respect to such a transition. Hence, each of the addends forming 
the quantity As 2 = c 2 At 2 — A l 2 changes in a transition from one 
frame to another; the quantity As 2 itself remains unchanged. 

The interval between two events occurring with a particle is in 
a simple relation with the length of the proper time between these 
events. By Eq. (8.21), the length of the proper time Ax is related 
to the time At measured on a clock of the frame relative to which 
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the particle is travelling with the velocity v by the expression 

At = At y/ r 1 — -Hi 

Let us transform this equation as follows: 

At = — V c 2 A * 2 — (^A *) 2 = - J- V c 2 A * 2 — AJ 2 

Here A l = vAt is the distance travelled by the particle during the 
time At. A comparison with Eq. (8.23) shows that 

At = -^- As (8.25) 

where As is the interval between events separated by the time At. 

It follows from Eq. (8.25) that the length of the proper time is 
proportional to the interval between events. The interval is an 
invariant. Consequently, the proper time is also an invariant, i.e. 
does not depend on the reference frame in which the motion of a given 
body is being observed. 

According to Eq. (8.23), the interval may be real (if cAt > A l) 
or imaginary (if cAt < A l). In a particular case, the interval may 
equal zero (ifcAZ = AZ). The last case occurs for events consisting 
in the emission of a light signal from the point x ly y ly z x at the moment 
t 1 and in the arrival of this signal at the point x 2 , y 2 , z 2 at the mo- 
ment Z 2 . Since here AZ — cAt, the interval between the events equals 
zero. 

Owing to its invariance, an interval that is real (or imaginary) 
in a reference frame K will be real (or imaginary) in any other iner- 
tial frame K ' . 

For a real interval, we have 

c 2 At 2 -Al 2 = c 2 At 2 — AV 2 > 0 

It can be seen from this expression that we can find a frame K ' in 
which AV — 0, i.e. both events will coincide in space. No reference 
frame exists, however, in which At' ~ 0 (the interval would become 
imaginary at such a value of At'). Thus, events separated by a real 
interval cannot become simultaneous in any reference frame. For 
this reason, real intervals are called timelike. 

We must note that events occurring with the same particle (we 
have in mind a particle with a rest mass differing from zero) can be 
separated only by a timelike interval. Indeed, the velocity of such 
a particle v is always lower than c. Hence, the path A l travelled 
by the particle is less than cAt, whence it follows that As 2 > 0. 
According to the last of the equations (8.17), we have 

A/' == A* — (P/g) Ax 

V 


( 8 . 26 ) 
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If A t and Ax separate events occurring with the same particle, then 
Ax/ At gives the component v x of the particle’s velocity. Therefore, 
Eq. (8.26) in this condition can be written in the form 

— (ft/c) (Ax/AQ At _ A t / 4 p v x \ 

Y i— p 2 y i — P a * p c / 

Since both p = vjc and vjc are less than unity, the quantity in 
parentheses in the right-hand side of the equation is positive for 
all frames K' . Hence, it follows that At' and At have the same signs. 
This signifies that two events occurring with a particle take place 
in the same sequence in all frames. For example, the birth of a par- 
ticle in all reference frames precedes its decay. 

For an imaginary interval, we have 

c 2 At 2 — Al 2 = c 2 At' 2 — Al' 2 < 0 

This shows that we can find a frame K' in which At' = 0, i.e. both 
events occur at the same momenta. No reference frame exists, how- 
ever, in which we would have Al' = 0 (the interval would be real 
with such a value of Al'). Thus, events separated by an imaginary 
interval cannot coincide in space in any reference frame. For this 
reason, imaginary intervals are called spacelike. 

The distance Al between points at which events separated by a 
spacelike interval occur exceeds cAt. Therefore, these events can- 
not in any way affect each other, i.e. cannot be causally related to 
each other (we remind our reader that no actions exist which propa- 
gate at a velocity exceeding c). 

Causally related events can be separated only by a timelike or 
a zero interval. 

8.5. Transformation and Addition 
of Velocities 


Let us consider the motion of a point particle. The position of 
the particle in the frame K is determined at each moment t by the 
coordinates x, y, z. The expressions 





are the projections of the vector of the particle’s velocity relative 
to the frame K onto the axes x, y, z . The position of the particle in 
the frame K' is characterized at each moment t ' by the coordinates 
x\ y\ z . The projections of the vector of the particle’s velocity 
relative to the frame K ' onto the axes x', y', z are determined by 
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the expressions 


v'x 


dx f 
dt' * 


# dy 

v-~dF 


From Eqs. (8.16), we have 



da :» dx '±^ L. 
/l-pg/c* ’ 


dy = dy\ dz — dz 


dt' + (vjc')dx' 
/l— i>»/c* 


(we have replaced P with u 0 /c). Dividing the first three equations 
by the fourth one, we get formulas for transformation of the ve- 
locities when passing over from one reference frame to another: 


V x 


I + VqV’Jc* * 


V'yY 1 — V?/c* 
V V~ 1 + V^V'Jc 1 » 


_ Vz/l— pg/c» 
z l + l> 0 l>i/c* 


(8.27) 


When y 0 <C c, equations (8.27) become the same as the velocity ad- 
dition equations (2.20) of classical mechanics. 

It is simple to obtain expressions for velocities in the frame K' 
through the velocities in the frame K from Eqs. (8.17): 


Vx 


■'x — Vo 
1 V 0 V x lc* * 


_ VyY\— vile* 
Vy ~ 1 -V 9 v x /c* • 


_ Vz Y Vp/c* 
Z 1 v 0 l > x lc % 


(8.28) 


These equations differ from equations (8.27) only in the sign before 
v 0 . This result could naturally be predicted. 

If a body is travelling parallel to the x-axis, its velocity v rela- 
tive to the frame K coincides with v x , and its velocity v' relative to 
the frame K' coincides with v' x . In this case, the law of velocity 
addition has the form 


v — 


1 + v 0 v'/e * 


(8.29) 


Assume that the velocity v' equals c. Hence, Eq. (8.29) gives us the 
following value for v : 

c + t; » — e 
1 + V 0 c/c* 

This result is not surprising because the Lorentz transformations 
(and, consequently, the velocity addition equations too) are based 
on the assertion that the speed of light is the same in all reference 
frames. Assuming in Eq. (8.29) that v = v 0 = c, we also get a value 
of c for v. Thus, if the velocities v' and v 0 being added do not exceed 
c, then the resultant velocity v also cannot exceed c. 
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8.6. Relativistic Expression 
for the Momentum 

Newton’s equations are invariant with respect to the Galilean 
transformations (see Sec. 2.7). They are not invariant, however, 
with respect to the Lorentz transformations. In particular, the law 
of momentum conservation (see Sec. 3.10) following from Newton’s 
laws is not invariant with respect to the Lorentz transformations. 
To convince ourselves in the truth of this statement, let us see what 
a completely inelastic collision of two identical balls of mass m 
is lik in the frames K and K' (Fi 8.4). 




K 


K* 


m t>» -O n m 

} — 0-^ *— O-tr 


X 


-2o a /(/+v$/c z ) 

/ 2 X 


Fig. 8.4 


Assume that the balls are moving toward each other in the frame K 
along the x-axis with velocities identical in magnitude whose projec- 
tions onto the x-axis are v xl — v 0 and v x2 = — v 0 (v 0 is the relative 
velocity of the frames K and K f ). In these conditions, the balls will 
be at rest after colliding: u xl = v x2 — 0. Thus, the total momentum 
of the system both before and after the collision equals zero — the 
momentum in the frame K is conserved. 

Let us now consider the same process in the frame K\ Using the 
first of the equations (8.28), we find for the velocities of the balls 
before they collide the values v' xi = 0 and v X2 = — 2i; 0 /(l + i; 2 /c 2 ), 
and for the velocities of the balls after they collide the same value 
v Xl = v' X2 = — v 0 . Therefore, the total momentum before the col- 
lision is — + i>J/c 2 ), and after the collision is — 2 mv 0 . If 
< c y the momentum of the system before and after the collision 
is virtually the same. When the balls are travelling with a great 
velocity i; 0 , however, the difference between the initial and the 
final momenta becomes quite appreciable. Thus, using the Newto- 
nian expression for the momentum, we arrived at the conclusion 
that the momentum does not seem to be conserved in the frame K\ 
One of the fundamental laws of mechanics — the law of momentum 
conservation — is not invariant with respect to the Lorentz trans- 
formations in the Newtonian formulation. 

It can be shown that the law of momentum conservation will be 
invariant with respect to the Lorentz transformations at any veloc- 
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ities if we substitute the proper time of a particle x for the time t 
in the classical expression 

p = mv = mj ( 8 . 30 ) 

Consequently, the relativistic expression for the momentum has the 
form 

P = m£ (8.31) 

When v <C c % the length of the proper time of a particle dx does not 
virtually differ from the length dt measured according to the clock 
of the frame in which the motion of the particle is being considered 
[see Eq. (8.21)1. Hence, Eq. (8.31) transforms into the classical 
expression (8.30). 

Remember that dr in Eq. (8.31) is the displacement of the particle 
in the reference frame in which the momentum p is determined, 
whereas the length of time dx is determined on a clock travelling 
together with the particle. 

We get an expression for the momentum through the time t of the 
frame of reference relative to which the motion of bodies is being 
observed. By Eq. (8.21), we have dx = dt Y 1 — v 2 lc 2 , where v 
is the velocity of the body. This substitution in Eq. (8.31) yields 

m dr 

P_ /l — 


or, since dr/dt = v: 


P 


my 

Y 1 — v 2 /e * 


(8.32) 


The mass m in Eq. (8.32) is invariant and, consequently, does not 
depend on the velocity of the body. 

It can be seen from Eq. (8.32) that the velocity dependence of the 
momentum is more complicated than is assumed in Newtonian 
mechanics. When v c, Eq. (8.32) transforms into the Newtonian 
expression p = my. 

We must note that Eq. (8.32) permits the following interpretation 
to be made, which is gradually losing favour. The momentum, as 
in Newtonian mechanics, equals the product of the mass of a body 
and its velocity: 

p = m T y (8.33) 


The mass of a body, however, is not a constant invariant quantity, 
but depends on the velocity according to the law 

r Y 1 — v*/c* 


(8.34) 
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In this interpretation, the invariant mass m is called the rest mass 
(it is often denoted by the symbol m 0 ). The non-invariant mass m T 
depending on the velocity is called the relativistic mass. 


8.7. Relativistic Expression 
for the Energy 

Newton’s second law states that the time derivative of the momen- 
tum of a particle (point particle) equals the resultant force acting 
on the particle [see Eq. (2.10)]. The equation of the second law is 
invariant relative to the Lorentz transformations if by the momentum 
we understand the quantity (8.32). Hence, the relativistic expres- 
sion of Newton’s second law has the form 


4(tS < 8 - 35 > 

It should be borne in mind that the equation ma = F cannot be 
used in the relativistic case, the acceleration a and the force F, 
generally speaking, being non-collinear. 

We shall note that neither the momentum nor the force are invar- 
iant quantities. Equations for the transformation of the momentum 
components when passing over from one inertial reference frame 
to another will be obtained in the following section. We give the 
equations for transformation of the force components without deriv- 
ing them: 

f fjawv F KY <EE rt/rr W (8 36) 

l+PfPi/c) 1+PWt) i + P(e*/C) 

(here p = vjc and v' is the velocity of a particle in the frame K'). 
If in the frame K' the force F' acting on a particle is perpendicular 
to the velocity of the particle v', the scalar product F'v' equals zero, 
and the first of the equations (8.36) is simplified as follows 


F x = 


F* 

1 + p (t'i/c) 


( 8 . 37 ) 


To find the relativistic expression for the energy, let us proceed in 
the same way as we did in Sec. 3.2. We shall multiply Eq. (8.35) 
by the displacement of a particle ds = v dt. The result is 


dt \ Y 1 — v*/c* 9 

The right-hand side of this equation gives the work dA done on the 
particle during the time dt . We saw in Sec. 3.2 that the work of the 
resultant of all the forces is spent on an increment of the kinetic 
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energy of the particle [see Eq. (3.11)]. Consequently, the left-hand 
side of the equation should be interpreted as the increment of the 
kinetic energy of the particle during the time dt. Thus, 

d — M- mv - rd( ■ my ) 

dt \|/i_ v i/ c z 1 V Y v*/c 2 ' 

Let us transform the obtained expression, bearing in mind that 
v dv = d, (v 2 /2) [see Eq. (1.54)]: 

ip f rndv , mv (v dv/c 2 ) ) _ 

k V X V i — vVc* (1 — i ;»/c*) 3/2 / 

m d (i; 2 /2) me 2 d (v*/c % ) — g ( mc * \ 

” (l_„*/c 2 ) 3/2 _ 2(1 — i?*/c*) 3/2 _ l 

Integration of this expression yields 


£ ‘-7T=fer+ <!0 ' lst (s- 38 ) 


According to the meaning of kinetic energy, it must vanish when 
v = 0. We thus get a value of — me 2 for the constant. Hence, the 
relativistic expression for the kinetic energy of a particle has the 
form 


E v = 


Y 1 — 


• me 2 


= me 2 ( - ■ — 1 - — l) 

v y i — v 2 /c 2 / 


(8.39) 


For small velocities ( v c), Eq. (8.39) can be transformed as 
follows: 

E k = me 2 ( pr===-— ! ) « me 2 (1+-J-5— 


We have arrived at the Newtonian expression for the kinetic energy 
of a particle. This is what should be expected because for velocities 
much smaller than the speed of light all the equations of relativistic 
mechanics must transform into the relevant equations of Newto- 
nian mechanics. 

Let us consider a free particle (i.e. one that does not experience 
the action of external forces) travelling with the velocity v . We have 
learned that this particle has a kinetic energy determined by Eq. (8.39) . 
We have grounds, however (see below), to ascribe the additional 
energy equal to 

E 0 — me 2 (8.40) 


to a free particle in addition to the kinetic energy (8.39). Thus, the 
total energy of a free particle is determined by the expression E = 
= E* + E 0 = + me 2 . With a view to Eq. (8.39), we find that 
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When v = 0, Eq. (8.41) transforms into Eq. (8.40). This is why 
E 0 = me 2 is called the rest energy. This energy is the internal energy 
of a particle not associated with its motion as a whole. Equations 
(8.40) and (8.41) hold not only for an elementary particle, but also 
for a complicated body consisting of many particles. The energy 
E 0 of such a body includes, apart from the rest energies of its parti- 
cles, the kinetic energy of these particles (due to their motion rela- 
tive to the body’s centre of mass) and the energy of their interaction 
with one another. The rest energy, like the total* energy (8.41), does 
not include the potential energy of a body in an external force field. 

Eliminating the velocity v from Eqs. (8.32) and (8.41) [Eq. (8.32) 
should be taken in the scalar form], we obtain an expression giv- 
ing the total energy of a particle through its momentum p: 

E — cY p 2 + m 2 c 2 (8.42) 

When p <C mc y this equation can be written in the form 

E=mc*\/ i + (-£-) 2 «^ 2 [i+4-(ir) 2 ]= mc2 +4- < 8 - 43 > 

The expression obtained differs from Newton’s equation for the 
kinetic energy E * = p 2 /2m in the addend me 2 . 

It must be noted that the following equation results from a com- 
parison of Eqs. (8.32) and (8.41): 

P = 7T V (8.44) 

We shall explain why the energy (8.41), and not only the kinetic 
energy (8.39), should be ascribed to a free particle. Energy according 
to its meaning must be a conserved quantity. The relevant treat- 
ment shows that when particles collide, the sum (for the particles) 
of expressions of the form of Eq. (8.41) is conserved, whereas the 
sum of Eqs. (8.39) is not conserved. It is impossible to comply with 
the requirement of energy conservation in all inertial reference 
frames if we do not include the rest energy (8.40) in the total energy. 

In addition, we succeed in forming an invariant, i.e. a quantity 
that does not change in the Lorentz transformations, from Eq. (8.41) 
for the energy and (8.42) for the momentum. Indeed, it can be seen 
from Eq. (8.42) that 

E- p2 — m 2 c 2 = i nv (8.45) 


* We shall note here that the term “total energy” has a different meaning 
in relativistic mechanics than in Newtonian mechanics. In the latter, the total 
energy is defined as the sum of the kinetic and potential energies of a particle. 
In relativistic mechanics, by the total energy is meant the sum of the kinetic 
and rest energies of a particle. 
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(we remind our reader that the mass m and speed c are invariant 
quantities). Experiments with fast particles confirm the invariance 
of the quantity in Eq. (8.45). If by E in Eq. (8.45) we understand 
the kinetic energy (8.39), then Eq. (8.45) will not be invariant. 

Let us obtain another expression for the relativistic energy. From 
Eq. (8.21), we find that 


1 = dt 

Y 1 — i> 2 /c* “ dx 


(8.46) 


where dt is the time that elapses between two events occurring with 
a particle and measured on a clock of the reference frame relative 
to which the particle is travelling with the velocity v, while dx 
is the same time measured on a clock travelling together with the 
particle (proper time). Using Eq. (8.46) in Eq. (8.41) we get the 
expression 

E = mc*£ (8.47) 


We shall use this equation in the following section. 


8.8. Transformations of Momentum 
and Energy 


The total energy E and momentum p are not invariants. Indeed, 
both quantities depend on v, while the latter has different values 
in different reference frames. Let us see how the energy and momen- 
tum transform when we pass over from one reference frame to 
another. 

Consider an elementary displacement of a particle. Assume that 
in the reference frame K this displacement occurs during the time dt , 
and its components are dx, dy , dz. In the frame K r , the same dis- 
placement occurs during the time dt' , and its components are dx , dy\ 
dz ' . According to Eqs. (8.18), the following relations hold between 
the lengths of time and the components of the displacement: 


dx = 


dx' j- fie dt' 

Y i-P* 


dy = dy' , dz = dz’ , c dt — cdt = — 
* y /1-P* 


Let us multiply these equations by the mass of the particle m and 
divide them by the proper time of the particle dx corresponding to 
the lengths of time dt and dt' (it should be remembered that the mass 
and the proper time are invariant quantities, i.e. have the same 
value in both frames). As a result, we get 


dx m (dx'/dx) -j- ftmc (dt' /dx) m dy dy ' # 

dx Y \ — P* T m dx m dx ’ 



m 


dz ' 
dx ’ 


dt me (dt' /dx)-\-$m (dx'/dx) 


(8.48) 
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By Eq. (8.31), we have m ( dx/dx ) = p x , m ( dx'/dx ) = p' x , 
m ( dy/dx ) = p u , etc. According to Eq. (8.47), we have me ( dt/dx ) = 
= E/c , and me ( dt’/dx ) = E'/c. Hence, Eqs. (8.48) can be written 
in the form 


Px = 


P ^+p(£7c) . 

Y i-P 2 * 


Py — Pyi 


E _ (g7c) + pPx 
c /l — 


Pz=Pzl 


(8.49) 


We have obtained equations by means of which the momentum 
and energy of a particle are transformed when we pass over from 
one inertial reference frame to another. These equations coincide 
with Eqs. (8.18) used to transform the coordinates and time. To 
facilitate a comparison, let us write Eqs. (8.18) and (8.49) side by side: 

_ *' + P(rt') . _ P’x + W'lc) . 

/rrp ’ p* Vi _p * 

y = y'; z = z'\ p v =p' y \ p z = p'z 
(rt\ = ( cf, ) + P a:, . JL = (E'/c)+fip' x (8.50) 

' 1 Y i— P a ’ c Y i— P* 


It follows from the comparison that the components of the momentum 
behave in transformations like coordinates, and the energy like time. 

The analogy disclosed by Eqs. (8.50) allows us to present the 
mathematics of relativistic mechanics in the form of relations be- 
tween vectors in an imaginary four-dimensional space (four-vectors). 
We have already noted in Sec. 8.1 that we have to ascribe unusual 
properties to this space which differ from the properties of the 
Euclidean space we are accustomed to. In three-dimensional Eucli- 
dean space, the quantity 

A Z 2 — As? + Ay 2 + Az 2 

is an invariant, i.e. does not change upon rotations of the coordinate 
axes. Unlike this, the quantity 

c 2 At 2 + As? + Ay 2 -f- Az 2 (8.51) 

is not invariant — it is not conserved upon transition from one iner- 
tial reference frame to another (such a transition can be imagined 
as rotation of the axes in four-dimensional space). Hence, the quan- 
tity (8.51) does not have the properties of the square of the distance 
between two world points. We have seen in Sec. 8.4 that Eq. (8.22), 
i.e. 


As? = c 2 At 2 — Ax 2 — Ay 2 — Az 2 
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is invariant, and it should be considered as the square of the distance 
between two points in the four-dimensional space we are interested 
in*. 

Having given four-dimensional space such properties, we can 
consider the quantities ct , x, y , z as the components of a four-vector 
drawn from the origin of coordinates to the given world point. Ac- 
cordingly, cAf, Ax, Ay, A z can be considered as components of a 
four-vector — the displacement from one world point to another. 
In three-dimensional Euclidean space, other vectors are dealt with 
(velocity, acceleration, force, etc.) in addition to the position and 
displacement vectors, and for any vector a, the quantity 

a 2 = a£ + aj + al 

is an invariant. The components of any such vector transform upon 
rotation of the coordinate axes according to the same equations as 
the coordinates do. 

By analogy with three-dimensional vectors in Euclidean space, we 
can determine four-dimensional vectors. A four-dimensional vector 
or four-vector is defined as a combination of the four quantities 
a t , a xr a v , a z that transform according to the same equations as 
ct , x, y, z [see the left-hand column of Eqs. (8.50)]. The “square” 
of such a vector should be determined as 

a\ — a% — a\ — a\ (8.52) 

Since the components transform in the same way as the coordinates, 
expression (8.52) is invariant with respect to the Lorentz trans- 
formations. 

Inspection of Eqs. (8.50) shows that the combination of the quan- 
tities 

Etc , p x , p y , p z (8.53) 

forms a four-vector. It is called the energy- momentum vector. An 
expression such as (8.52) formed from the components (8.53), as 
we have established [see Eq. (8.45)], is an invariant: 

( -f" ) 2 — P* — Pv — P* = m2c2 


8.9. Relation Between Mass and Energy 

Using the relativistic mass [see Eq. (8.34)], we can write Eq. (8.41) 
in the form 

E = m T c 2 (8.54) 

It can be seen from this equation that the energy of a body and its 
relativistic mass are always proportional to each other. Any change 

* Naturally, we can also consider Euclidean four-dimensional space. The 
latter is not suitable for the needs of relativistic mechanics, however. 
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in the energy of a body A E (except for a change in the potential 
energy in an external force field) is attended by a change in the rela- 
tivistic mass of the body A m T = A Elc 2 , and, conversely, any change 
in the relativistic mass A m T is attended by a change in the energy 
of the body 

A£ = c 2 Am r (8.55) 


This statement is called the law of the relation between the relati- 
vistic mass and energy*. 

The proportionality between the relativistic mass and energy 
leads to the fact that the statement on the conservation of the total 
relativistic mass of particles is the statement on the conservation 
of the total energy using different words. In this connection, it is 
not customary practice to speak of the law of relativistic mass con- 
servation as of a separate law. 

Unlike the relativistic mass, the total rest mass of a system of 
interacting particles is not conserved. For example, upon an inelas- 
tic collision of two particles observed in the frame of their centre 
of mass, the rest mass of the particle formed is 


mz — iti^ — |— wi2 


E k, i + #k, 2 
c 2 


where m 1 and 1 E ktl are the rest mass and kinetic energy of the first 
initial particle, and m 2 and 2? k 2 are the relevant quantities of the 
second particle. Thus, 

372.2 > rn x + m 2 

In this case, the kinetic energy of the initial particles transformed 
into the internal energy of the formed particle. As a result, the rest 
mass of this particle exceeded the sum of the rest masses of the ini- 
tial particles. 

The operation of nuclear power plants is based on the chain reac- 
tion of fission of nuclei of uranium 93 U 235 (or plutonium) when they 
capture slow neutrons n**. Fission occurs in various ways. One of 
the reactions is 

92 U 235 + n-> 92 U* 3 * 55 Cs 140 + 3 7 Rb 94 + 2n (8.56) 

After capturing a neutron, a uranium nucleus decays into a cesium 
nucleus with the mass number 140 and a rubidium nucleus with the 
mass number 94. Two neutrons are also emitted. The total rest mass 
of uranium-235 and a neutron exceeds the total rest mass of the 


* We sometimes speak of the equivalence of mass and energy having in mind 
their relation and proportionality to each other. 

** The symbol 92 U 23S standsforthe uranium isotope with a mass number of 
235. The nucleus of an atom of this isotope consists of 92 protons and 235 — 92 = 
= 143 neutrons. The symbol n stands for a neutron. 




Relativistic Mechanics 


247 


particles in the right-hand side of the reaction formula (8.56) by 
about 4 X 10"* 28 kg. The internal energy corresponding to this sur- 
plus mass and equal to 

E = c*Am = (3 x 10 8 ) 2 x 4 x 10“ 28 «4x lO* 11 J 

transforms into the kinetic energy of the particles formed (fission 
fragments) and into the energy of electromagnetic radiation appear- 
ing upon fission. 


8.10. Particles with a Zero Rest Mass 


Assuming in Eq. (8.42) that m equals zero, we get 

E = cp (8.57) 

This equation agrees with Eq. (8.44) only if v — c. Hence it fol- 
lows that a particle having a zero rest mass always travels with the 
speed of light. Such particles include a light particle called a pho- 
ton, and also elementary particles called neutrinos* 

The energy of a photon is determined by the equation 

E = h(o (8.58) 


where h = Plancks’ constant h divided by 2n 
(o = cyclic frequency [see Eq. (7.58)1. 

According to Eqs. (8.57) and (8.58), a photon has the momentum 




h CD 
c 


(8.59) 


Light is a stream of photons. When light is absorbed or reflected 
from the surface of a body, a momentum is imparted to the latter. 
This manifests itself in the form of pressure exerted by the light on 
the body. P. Lebedev succeeded in discovering and measuring light 
pressure in 1900. The results of his measurements completely agreed 
with Eq. (8.59). 

According to Einstein’s general theory of relativity, any object 
having the energy E also has the gravitational mass 



i.e. it should be attracted to other objects. Accordingly, a photon 
should behave in a gravitational field like a particle of the gravi- 
tational mass 


m g = 



(8.60) 


Particularly, when moving vertically upward near the Earth’s sur- 
face, a photon must spend part of its energy on doing work against 
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the forces of gravity equal to 


A- m , e i=S^L 


where l is the distance travelled. Accordingly, the initial energy 
of a photon equal to h<a must diminish by 


A£ = A(fio)) = -^ 


Hence 



We thus get the following expression for the relative reduction in 
the frequency of a photon: 


Aoo gl 

0) c a 


(8.61) 


The change in the frequency of a photon when propagating ver- 
tically was measured in 1959 by the U.S. scientists R. Pound and 
G. Rebka, Jr. Their result coincided with that calculated by 
Eq. (8.61) with an accuracy of 15%. We must note that in the con- 
ditions of their experiment the relative change in the frequency had 
a negligibly small value equal to 2 X 10 ~ 15 . 

The effect of the change in the frequency of light when moving 
away from a large gravitating mass is called the gravitational 
red shift. The meaning of this term will be disclosed in the 
third volume of the present course. 



CHAPTER 9 HYDRODYNAMICS 


9.1. Streamlines and Flow Tubes. 

Flow Continuity 

Mechanics of continuous media exists in addition to the mechan- 
ics of a point particle and the mechanics of a rigid body which we 
treated in preceding chapters. This science covers hydrodynamics, 
gas dynamics, the theory of elasticity (some questions of which 
were dealt with in Secs. 2.9 and 3.8), and a number of other branches 
of science considering a substance as a continuous medium. Hydro- 
dynamics is the branch of mechanics of continuous media studying 
the motion of incompressible liquids 
and their interaction with solids. 

To describe the motion of a 
liquid, we can set the position of 
each of its particles as a function 
of time. This method of description 
was worked out by J. Lagrange. 

But it is also possible to observe Fig. 9.1 

separate points of space instead of 

liquid particles and record the velocity with which separate parti- 
cles of the liquid pass each given point. The second method is 
called the Euler method. 

The state of motion of a liquid can be determined by indicating 
the velocity vector as a function of time for each point of space. The 
combination of the vectors v given for all the points of space forms 
the so-called velocity vector field that can be depicted as follows. Let 
us draw lines in a flowing liquid so that a tangent to them at each 
point coincides in direction with the vector v (Fig. 9.1). These lines 
are called streamlines. We shall agree to draw the streamlines so that 
their density (characterized by the ratio of the number of lines AN 
to the magnitude of the area AS at right angles to them through 
which they pass) is proportional to the magnitude of the velocity 
at the given place. The pattern of the streamlines will thus permit 
us to assess not only the direction, but also the magnitude of the 
vector v at different points of space: the streamlines will be closer 
together where the velocity is higher and, conversely, farther apart 
where the velocity is lower. 

Since the magnitude and the direction of the vector v may change 
with time at every point, then the pattern of the streamlines may 
also change continuously. If the velocity vector is constant at each 
point of space, then the flow is called steady. In steady flow, any 
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particle of a liquid passes a given point of space with the same value 
of v. The pattern of the streamlines in steady flow remains un- 
changed, and the streamlines in this case coincide with the traje- 
ctories of the particles. 

A portion of a liquid confined by streamlines is called a flow tube. 
The vector v, being at each point tangent to a streamline, will also 
be tangent to the surface of the flow tube. Hence, the particles of 
the liquid in their motion do not intersect the “walls” of the flow 
tube. 

Let us take a cross section S of a flow tube (Fig. 9.2) at right angles 
to the direction of the velocity. We shall assume that the velocity 


S 



vAt 


Fig. 9.2 


$2 



of the liquid particles is the same at all points of this section. Dur- 
ing the time A t, all the particles whose distance from S at the ini- 
tial moment did not exceed the value vAt will pass through sec- 
tion S . Consequently, a volume of the liquid equal to SuAt will 
pass through section S during the time At, and a volume of the 
liquid equal to Sv will pass through it in unit time. Let us take 
a flow tube so thin that at each section of it the velocity may be 
considered constant. If the liquid is incompressible (i.e. its density 
is the same everywhere and cannot change), then the amount of 
liquid between sections S x and S 2 (Fig. 9.3) will remain constant. 
Hence, it follows that the volumes of liquid flowing in a unit time 
through sections S x and S 2 must be the same: 

= S 2 v 2 

(we remind our reader that the particles of the liquid do not pass 
through the side surface of a flow tube). 

The above reasoning may be applied to any pair of sections S t 
and S 2 . Consequently, for an incompressible liquid , the quantity Sv 
must be the same for any section of the same flow tube : 

Sv = const (9.1) 

The result obtained forms the content of the theorem on flow con- 
tinuity. 
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It can be seen from Eq. (9.1) that when a flow tube has a varying 
section the particles of an incompressible liquid will move with 
acceleration. In a horizontal flow tube (Fig. 9.4), this acceleration 
can be due only to the lack of constancy of the pressure along the 
axis of the tube — at places where the 
velocity is smaller, the pressure must 
be greater, and vice versa. The quan- 
titative relation between the flow 
velocity and the pressure will be estab- 
lished in the following section. 

The theorem on flow continuity can 
be applied to real liquids and even 
to gases when their compressibility may be disregarded. The 
relevant calculations show that when fluids flow with velocities 
lower than the speed of sound, they may be considered incompres- 
sible with a sufficient degree of accuracy. 



9.2. Bernoulli’s Equation 


When dealing with the motion of liquids, we can often consider 
that the displacement of some portions of a liquid relative to others 

is not associated with the 
appearance of forces of fric- 
tion. A liquid in which inter- 
nal friction (viscosity) is com- 
pletely absent is called ideal 
(or non- viscous). 

Let us separate a flow 
tube of small cross section 
(Fig. 9.5) in a steadily flowing 
ideal liquid. We shall con- 
sider the volume of the liquid 
confined by the “walls” of the 
flow tube and by cross sections 
S x and S 2 perpendicular to 
the streamlines. During the 
time A £, this volume will 
move along the flow tube. 
Section S x will move to posi- 
tion S[ having covered the 
distance A l u and section S 2 
position S '2 having covered the distance A Z 2 - 
continuity, the hatched volumes will be equal: 
AF. 



h 

I 


Fig. 9.5 


will move to 
Owing to flow 
AV X = AV 2 = 
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The energy of each liquid particle consists of its kinetic energy 
and its potential energy in the field of the Earth’s gravitational 
forces. Owing to the steady nature of the flow, a particle that after 
the time At is at any point in the unhatched part of the volume 
being considered (see, for example, point O in Fig. 9.5) has the same 
velocity (and, consequently, kinetic energy) as the particle did 
that was at the same point at the initial moment. Hence, the energy 
increment A E of the entire volume being considered can be calcu- 
lated as the difference between the energies of the small hatched 
volumes AV 2 and AF X . 

Let us take a flow tube cross section and the lengths A l so small 
that the same values of the velocity i>, pressure p , and height h can 
be ascribed to all the points of each of the hatched volumes. Hence, 
the energy increment can be written as follows: 

AE = ( + pA Vgh 2 ) - + pA Vgh t ) (9.2) 

(p is the density of the liquid). 

Forces of friction are absent in an ideal liquid. Therefore, the 
energy increment (9.2) must equal the work done by the pressure 
forces on a separated volume. The forces of pressure on the side 
surface are perpendicular at each point to the direction of motion 
of the particles to which they are applied, consequently, they do no 
work. Only the work of the forces applied to sections S x and S 2 
differs from zero. This work is 

A = piS^Al^ — p 2 S 2 Al 2 ~ (j?\ — p 2 ) AV (9*3) 

Equating Eqs. (9.2) and (9.3), cancelling AF, and transferring 
terms with the same subscripts to the same side of the equation, 
we get 

+ P gh t + Pi = ^ + pgh 2 + p z (9.4) 

Sections S x and S 2 were taken absolutely arbitrarily. We can there- 
fore assert that the expression pv 2 /2 + p gh + p has the same value 
for any section of the flow tube. In accordance with the assumptions 
we made in deriving Eq. (9.4), it becomes quite accurate only when 
the cross section S tends to zero, i.e. when the flow tube contracts 
into a streamline. Thus, the quantities /?, v and h in the left-hand 
and right-hand sides of Eq. (9.4) should be considered as relating 
to two arbitrary points of the same streamline. 

The result obtained can be formulated as follows: the condition 

+ pgh +p = const (9.5) 

is observed in a steadily flowing ideal liquid along any streamline . 
Equation (9.5) or Eq. (9.4) equivalent to it is called Bernoulli’s 
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equation, in honour of its discoverer, the Swiss mathematician 
Daniel Bernoulli (1700-1782). Although we obtained this equation 
for an ideal liquid, it is obeyed sufficiently well for real liquids in 
which the internal friction is not very 

great. watv 

Equation (9.4) acquires the following 
form for a horizontal streamline: 

^+P,=4+ ft 

i.e. the pressure is smaller at the points 
where the velocity is great (this was already 
shown qualitatively in the preceding 
section). 

The diminishing of the pressure at points 
where the velocity of a flow is greater 
underlies the design of a water-jet pump 
(Fig. 9.6). A water stream is fed into a tube 
opening to the atmosphere so that the pres- 
sure at the outlet from the tube is atmo- 
spheric. The tube has a constriction through 
which the water flows with a higher veloc- 
ity. As a result, the pressure at this spot 
is below atmospheric. The same pressure sets in in the pump 
chamber surrounding the tube. The chamber communicates with 
the tube via an opening in its narrow part. By connecting a vessel 
to be evacuated to the pump chamber, we can pump the air (or 
some other gas) out of it to a pressure of the order of 100 mmHg 
The evacuated air is entrained by the water stream and carried off 
into the atmosphere. 



9.3. Flow of a Liquid from a Hole 

Let us apply Bernoulli’s equation to the flow of a liquid from a 
small hole in a wide open vessel. Let us separate in the liquid a flow 
tube having the open surface of the liquid in the vessel as one of 
its cross sections and the hole through which the liquid flows out* 
as the other one (Fig. 9.7). For each of these sections, the velocity 
and the height above an initial datum level may be considered the 
same. Consequently, we can apply Eq. (9.4), obtained on this as- 
sumption, to these sections. Further, the pressure in both sections 
is atmospheric and therefore the same. In addition, the velocity of 

* More exactly, the cross section of the flow emerging from the hole If 
special measures are not taken, the section of the flow will be smaller than the 
hole. 
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the open surface in the wide vessel can be assumed to equal zero. 
With a view to everything said above, Eq. (9.4) can be written 
in the following form for this case: 

Pghi = - £ Y~+Pg f h 

where v is the velocity of the liquid flowing from the hole. Cancelling 
p and introducing h — h x — h 2 , i.e. the height of the open surface 
of the liquid above the hole, we get v 2 /2 = gh , whence 

V = Y2gh (9.6) 

This formula is known as the Torricelli formula (after the Italian 
physicist Evangelista Torricelli, 1608-1647). 

Thus, the velocity with which a liquid is discharged from a hole 
at a depth of h under an open surface coincides with the velocity 



Fig. 9.7 Fig. 9.8 


which a body acquires in falling from the height h. Do not forget 
that this result was obtained on the assumption that the liquid is 
ideal. The discharge velocity will be smaller for real liquids, the 
difference from the value given by Eq. (9.6) growing with an increas- 
ing viscosity of the liquid. 

A stream of liquid discharged from a hole in a vessel (Fig. 9.8) 
carries along with it during the time At the momentum Ap — 
= pSv\At (p is the density of the liquid, S is the cross-sectional 
area of the hole, v is the discharge velocity of the flow). This momen- 
tum is imparted to the discharged liquid by the vessel. According 
to Newton’s third law, the vessel receives a momentum equal to 
— Ap from the discharged liquid during the time At, i.e. experiences 
the action of the force 

F r |f = — pSvy 


(9.7) 
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This force is called the reaction of the discharged flow (or the thrust). 
If our vessel is placed on a cart, then under the action of the force 
F r it will start moving in the direction opposite to that of the dis- 
charged flow. 

Let us find the value of the force F r using Eq. (9. 6) for the discharge 
velocity of a liquid from a hole: 

F T = p Sv* = 2ghpS (9.8) 

If, as may seem at first sight, the force F r coincided in magnitude 
with the force of hydrostatic pressure which the liquid would exert 
on a plug closing the hole, then F t would equal ghpS. The force F r 
is actually double this value. The explanation is that the motion 
of the liquid in the vessel appearing when it is discharged leads 
to redistribution of the pressure, and the pressure near the wall 
opposite the hole is somewhat greater than near the wall containing 
the hole. 

The operation of jet engines and rockets is based on the reaction 
or thrust of a discharged gas stream. Reactive motion, not requir- 
ing an atmosphere for its accomplishment, is used for flights in 
outer space. 

The outstanding Russian scientist and inventor Konstantin 
Tsiolkovsky (1857-1935) founded the theory of interplanetary com- 
munications. He presented the theory of a rocket’s flight and sub- 
stantiated the possibility of using jet engines for interplanetary 
flights. In particular, Tsiolkovsky worked out the theory of motion 
of composite rockets in which each following stage starts function- 
ing after the preceding stage, having completely used up its fuel, 
separates from the rocket. Tsiolkovsky’s ideas were further devel- 
oped and realized by Soviet scientists and engineers who ensured 
the leading role of the Soviet Union in the mastering and studying 
of outer space. 


9.4. Forces of Internal Friction 

An ideal liquid, i.e. one without friction, is an abstraction. Vis- 
cosity or internal friction is a property inherent to some extent or 
other in all real fluids (liquids and gases). Viscosity manifests itself 
in that motion set up in a fluid gradually stops after the action of 
the reasons causing the motion is discontinued. 

Let us consider the following experiment to reveal the laws which 
forces of internal friction obey. Two parallel plates whose linear 
dimensions considerably exceed the distance d between them 
(Fig. 9.9) are immersed in a liquid. The bottom plate is held in place, 
while the top one is brought into motion relative to the bottom 
one with a certain velocity v 0 . The experiment shows that to move 
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the top plate with a constant velocity v 0 , we have to exert on it 
a quite definite force F that is constant in magnitude. Since the 
plate receives no acceleration, this signifies that the action of this 
force is balanced by a force equal to it in magnitude and opposite 
in direction which is evidently the force of friction acting on the 
plate when it moves in the liquid. Let us denote it by F fr . 

By varying the velocity of the plate v 0 , the area of the plates S, 
and the distance d between them, we can find that 

F fr = ri-J-5 (9.9) 

where q is a constant of proportionality depending on the nature 
and state (for instance, the temperature) of the liquid and called 



the coefficient of internal friction or the viscosity of the liquid (gas). 
Sometimes the quantity q determined from Eq. (9.9) is called the 
dynamic viscosity as distinct from the kinematic viscosity v equal 
to q/p, where p is the density of the fluid — see Sec. 9.5. 

The bottom plate upon motion of the top one also experiences 
the action of the force Ff r equal in magnitude to F fr . For the bottom 
plate to remain stationary, the force FJ r must be balanced with the 
aid of the force F\ 

Thus, when the two plates immersed in the liquid move relative 
to each other, interaction characterized by the force (9.9) appears 
between them. The plates obviously act on each other through the 
liquid between the plates, the force of interaction being transmitted 
from one layer of the liquid to another. If at any place in the gap 
between the plates we mentally draw a plane parallel to them (see 
the dash line in Fig. 9.9), then we can assert that the part of the 
liquid above this plane acts on the part of the liquid under it with 
the force Ff r , and the part of the liquid under the plane, in turn, 
acts on the part above the plane with the force F fr , the values of 
F fr and FJ r being determined by Eq. (9.9). Thus, Eq. (9.9) deter- 
mines not only the force of friction acting on the plates, but also 
the force of friction between the parts of the liquid in contact. 
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If we study the velocity of the liquid particles in different layers, 
it will be found to change in the direction z at right angles to the 
plates (Fig. 9.9) according to a linear law: 

y ( z )=ir z ( 9 - 10 ) 

The liquid particles in direct contact with the plates adhere to them, 
as it were, and have the same velocity as the plates themselves. 
By Eq. (9.10), 



We have used the magnitude sign for the following reason. If we had 
fastened the top plate and moved the bottom one (see Fig. 9.9) or 
had reversed the direction of the z-axis, the derivative dvtdz would 
have become negative. The value of vjd , however, is always posi- 
tive. Hence, for Eq. (9.11) to hold in any case, we must take the 
magnitude of dvtdz . 

Using Eq. (9.11), we can write Eq. (9.9) as follows: 

F "- t i|7r| 5 < 9 - 12 > 

This equation determines the magnitude of the force of friction. 
The quantity | dvtdz | shows how fast the velocity changes in the 
direction of the z-axis, and is the magnitude of the gradient of the 
magnitude of the velocity (if v depends only on z, then dv/dx = 
= dvldy = 0, dvtdz — dvtdz). 

We have obtained Eq. (9.12) for the case when the velocity changes 
according to a linear law. It was found that this equation also 
holds for any other law of the change in the velocity from layer to 
layer. In this case to determine the force of friction between two 
neighbouring layers, we must take the value of | dvtdz | at the place 
where the imaginary interface between the layers passes. 

Everything said in this section relates to all fluids. 

The unit of viscosity in the SI system is the viscosity at which 
the gradient of the velocity with a magnitude of 1 m/s per m leads 
to the appearance of a force of internal friction of 1 N per m 2 of 
surface of contact of the layers. This unit is called the pascal-second 
(Pa*s)*. 

The unit of viscosity in the cgs system is the poise (P) equal to 
the viscosity at which the gradient of the velocity with a magnitude 
of 1 cm/s per cm leads to the appearance of a force of internal fric- 
tion of 1 dyn per cm 2 of surface of contact of the layers. The unit 
equal to 10“ 6 P is called the micropoise (pP). The poise and the 


* The pascal is the name given to the unit of pressure in the SI system 
(1 Pa = 1 N/m 2 ). 
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pascal-second are related as follows: 

1 Pa-s = 10 P 

The viscosity depends on the temperature. The nature of this 
dependence appreciably differs for liquids and gases. The viscosity 
of liquids greatly diminishes with increasing temperature. The 
viscosity of gases, on the contrary, grows with increasing tempera- 
ture. The difference in the behaviour of t| with changes in the tem- 
perature points to the difference in the mechanism of internal fric- 
tion in liquids and gases. 


9.5. Laminar and Turbulent Flows 

Two kinds of flow of a liquid (or gas) are observed. In some cases, 
the liquid separates, as it were, into layers that slide relative to 
one another without mixing. Such flow is called laminar (from the 
Latin word “lamina” meaning plate or strip). If we introduce a co- 
loured stream into a laminar flow, it is retained without being 
washed out over the entire length of the flow because the liquid par- 
ticles in a laminar flow do not pass over from one layer to another. 
A laminar flow is steady. 

With an increase in the velocity or cross-sectional dimensions of 
a flow, its nature changes quite appreciably. Vigorous stirring of 
the liquid appears. Such a flow is called turbulent. In a turbulent 
flow, the velocity of the particles at each given place constantly 
changes chaotically — the flow is not steady. If we introduce a co- 
loured stream into a turbulent flow, already at a small distance 
from the place of its introduction the coloured liquid will be uni- 
formly distributed over the entire cross section of the flow. 

The British scientist Osborne Reynolds (1842-1912) established 
that the nature of a flow depends on the value of the dimensionless 
quantity 

= (9.13) 

where p = density of the liquid (or gas) 

v = average flow velocity (over the cross section of the pipe) 
T| =s viscosity of the liquid 

l = dimension characterizing the cross section, for example, 
the side of the square with a square cross section, the 
radius or diameter with a round section, etc. 

The quantity Re is called the Reynolds number. At small values 
of the Reynolds number, laminar flow is observed. Beginning from 
a certain definite value of Re called the critical one, the flow ac- 
quires a turbulent nature. If for a round pipe we take its radius r as the 
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characteristic dimension, then the critical value of the Reynolds 
number (which in this case has the form Re = pi;r/r|) equals* approx- 
imately 1000. The Reynolds number includes the ratio of two quan- 
tities depending on the properties of a liquid — the density p and 
the viscosity r\. The ratio 

V = f (9.14) 

is called the kinematic viscosity. In contrast to v, the quantity tj 
is known as the dynamic viscosity. Using the kinematic viscosity, 
we can write the Reynolds number as follows: 

^=4 ( 9 - 15 > 

The Reynolds number can be used as a dimensionless or similarity 
number for the flow of liquids in pipes, channels, etc. The nature of 
the flow of different liquids (or gases) in pipes of different cross 
sections will be absolutely the same if the same value of Re corres- 
ponds to each flow. 


9.6. Flow of a Liquid in a Round Pipe 

When a liquid flows through a round pipe, the velocity is zero at 
the pipe walls and maximum at its axis. Assuming the flow to be 
laminar, let us find the law of the change in the velocity with the 
distance r from the pipe axis. 

Let us separate an imaginary cylindrical volume of the liquid 
of radius r and length Z (Fig. 9.10). Upon steady flow in a pipe of 
constant cross section, the velocities 
of all the particles of the liquid remain 
unchanged. Hence, the sum of the 
external forces applied to any volume 
of the liquid is zero. The bases of the 
cylindrical volume being considered 
experience forces of pressure whose 
sum is (p A — p 2 ) nr 2 . This force acts 
in the direction of motion of the 
liquid. In addition, the side surface of the cylinder experiences a 
force of friction equal to r| | dvldr | 2jirZ (we have in view the value 
of dvldr at the distance r from the pipe axis). The condition for steady 
flow has the form 

(Pi — Pz)nr 2 = i\ |-^r|2jwZ (9.16) 

* It is obvious that if we take the diameter of the pipe instead of its radius 
as the quantity Z, we must double the critical value of Re. 
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The velocity diminishes with an increasing distance from the 
pipe axis. Consequently, dv/dr is negative, and | dv/dr | = — dv/dr. 
Taking this into account, we shall transform Eq. (9.16) as follows: 

(Pi — P») r 

dr 2r\l 

Separating the variables, we get 

dv=—£ ipBs-rdr 

2t)1 

Integration yields 

v= - Il W*- r *+ c < 9 - 17 > 

The integration constant must be selected so that the velocity will 



vanish at the pipe walls, i.e. with r = R (R is the pipe radius). 
From this condition 

Introduction of the value of C in Eq. (9.17) gives 

r!)= T^r lfi2 ( 1 — ^-) (9 - ,8> 

The value of the velocity along the axis of the pipe is 

„.=„( 0 ) = - £ ^« 2 ( 9 -* 9 ) 

By using this equation in Eq. (9.18), we can obtain 

v{r)=v,(i-±) (9.20) 

Thus, with laminar flow, the velocity changes with an increasing 
distance from the axis of a pipe according to a parabolic law 

(Fig. 9.11). . . . 

With turbulent flow, the velocity at each point changes chaoti- 
cally. With constant external conditions, the average (in time) veloc- 
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ity at each point of the cross section of a pipe is constant. The pro- 
file of the average velocities in turbulent flow is shown in Pig. 9.12. 
The velocity changes near the walls of a pipe at a much greater 
rate than in laminar flow. In the remaining part of the cross section, 
the change in the velocity is smaller. 

Assuming the flow to be laminar, let us 
calculate the rate of flow of the liquid Q f 
i.e. the volume of liquid flowing through 
the cross section of a pipe in unit time. 

Let us divide the cross section of the pipe 
into rings with a width of dr (Fig. 9.13). 

In one second, a volume of liquid equal 
to the product of the ring area 2 nr dr 
and the velocity of the flow at the 
points at a distance of r from the pipe 
axis will pass through a ring of radius r. 

With a view to Eq. (9.20) we get 

dQ = v 0 (l— 2 nr dr (9.21) 

To obtain the rate of flow Q , we must integrate Eq. (9.21) with 
respect to r within the limits from zero to R: 

R 

Q= j v 0 (l— -J^-) 2nrdr = -^-nR 2 u 0 = ~^-Sv 0 (9.22) 

o 



Fig. 9.13 


( S is the cross-sectional area of the pipe). Inspection of Eq. (9.22) 
shows that in a laminar flow the average value of the velocity (over 
the cross section) is half the value of the velocity at the axis of the 
pipe. 

Substituting in Eq. (9.22) the value for v 0 from Eq. (9.19), we 
get the following formula for the rate of flow: 


r n_(Pi — P2) 
8ti* 


(9.23) 


This formula is called the Poiseuille formula. According to it, 
the flow of a liquid is proportional to the pressure drop per unit pipe 
length, to the fourth power of the pipe radius, and is inversely pro- 
portional to the viscosity of the liquid. It must be remembered that 
the Poiseuille formula may be applied only for a laminar flow. 

Formula (9.23) is used to determine the viscosity of liquids. By 
passing a liquid through a capillary of known radius and measuring 
the pressure drop and the rate of flow we can find tj. 
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9.7. Motion of Bodies in Fluids 

Forces whose resultant will be designated by the symbol R 
(Fig. 9.14) act on a body upon its motion in a fluid*. The force R 
can be resolved into two components, of which Q is directed oppo- 
site to the motion of the body (or in the direction of the flow advanc- 
ing onto the body), and P at right angles to this direction. The 
components Q and P are called the drag (or head resistance) and the 
lift (or lifting force), respectively. It is obvious that only a drag can 
act on a body that is symmetrical relative to the direction of motion, 
while the lift in this case will vanish. 

Calculations show that the uniform motion of bodies in an ideal 
fluid should occur without drag. Having no viscosity, an ideal 




fluid should slide freely over the surface of a body, flowing com- 
pletely around it. Figure 9.15 shows the streamlines when an ideal fluid 
flows around a very long (“infinite”) cylinder. Owing to complete 
flowing around the cylinder, the pattern of the streamlines is abso- 
lutely symmetrical both relative to the straight line passing through 
points A and B and relative to the straight line passing through 
points C and D . Hence, the pressure near points A and B will be the 
same (and greater than in an undisturbed flow because the velocity 
near these points is lower). The pressure near points C and D will 
also be the same (and lower than in an undisturbed flow because the 
velocity near these points is higher). Consequently, the resultant 
force of the pressure on the surface of the cylinder (which in the 
absence of viscosity could set up a drag) will evidently vanish. The 
same result is also obtained for bodies of a different shape. 


* We shall note that with a constant velocity of a body relative to a fluid 
the force acting on the body, according to Galileo’s principle of relativity, will 
be the same as when the fluid is moving with the same velocity relative to the 
stationary body. Figure 9.14 corresponds to the latter case. 
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Other phenomena are encountered when a body moves in a vis- 
cous fluid. In this case, a very thin layer of the fluid adheres to the 
body’s surface and moves together with it as a single whole, carry- 
ing along the following layers owing to friction. The velocity of the 
layers diminishes with an increasing distance from the body’s sur- 
face and, finally, at a certain distance from the surface the fluid is 
virtually undisturbed by the motion of the body. The body is thus 
surrounded by a layer of the fluid in which there is a velocity gra- 
dient. This layer is called the boundary one. Friction forces act 



Fig. 9.16 



Fig. 9.17 


in it which in the long run are applied to the body and lead to the 
appearance of a drag. But matters are not exhausted here. The pres- 
ence of a boundary layer radically changes the nature of the flow 
of a fluid around a body. Complete flowing around becomes impos- 
sible. The action of the friction forces in the surface layer causes 
the flow to break away from the body’s surface. The result is the 
appearance of eddies behind the body (see Fig. 9.16 showing the 
flow of a viscous Quid around a cylinder). The eddies are carried 
away by the Qow and gradually attenuate owing to friction. The 
energy of the eddies is spent for heating the Quid. The pressure in 
the eddy region formed behind the body is lowered. Consequently, 
the resultant of the pressure forces will differ from zero, this lead- 
ing in turn to a drag. 

The drag thus consists of the friction drag and the pressure drag. 
With given cross-sectional dimensions of a body, the pressure drag 
greatly depends on its form. For this reason, it is also called the 
form drag. Bodies with a well streamlined drop-shaped form 
(Fig. 9.17) have the smallest pressure drag. Designers do everything 
possible to impart such a form to the fuselage and wings of aircraft, 
to the body of motor vehicles, etc. 

The ratio between the friction drag and the pressure drag is deter- 
mined by the value of the Reynolds number (9.13). Here, l is a char- 
acteristic dimension of the body in question (for example, the radius 
for a spherical body), and v is the velocity of the body relative to the 
fluid. 
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At small values of f?e, the main part is played by the friction drag, 
so that the pressure drag may be disregarded. The part of the pres- 
sure drag grows more and more with increasing Re. At great values 
of Re , pressure forces predominate in the drag. 

When determining the nature of the forces acting on a body in 
a flow, the Reynolds number can be used as a similarity number 
(scale factor) in this case too. This circumstance is taken advantage 
of in modelling. For example, a model of an aeroplane will behave 
in a gas flow the same as the full-scale counterpart if in addition to 
geometrical similarity of the model and the aeroplane, the Rey- 
nolds number will also be equal for them. 

The Stokes Formula. At small values of Re , i.e. at low 
velocities [and small Z’s, see Eq. (9.13)], the resistance of a medium 




is due virtually only to the friction forces. George Stokes (1819- 
1903) established that the drag force in this case is proportional to 
the dynamic viscosity t], the velocity v of a body relative to the 
fluid, and the characteristic dimension of the body Z, i.e. F oc r\lv 
(it is assumed that the distance from the body to the boundaries of 
the fluid, for example, to the walls of a vessel confining it, consider- 
ably exceeds the dimensions of the body). The proportionality con- 
stant depends on the form of the body. For a sphere, if we take its 
radius r as the dimension Z, the proportionality constant is 6jx. Hence, 
the drag force on a sphere in fluids at small velocities, according to 
the Stokes formula, is 

F — 6jit ]rv (9.24) 

Lift. The viscosity of a fluid is of no significance for the appear- 
ance of a lift. Figure 9.18 shows the streamlines when an ideal fluid 
flows around a half-cylinder. Owing to complete flowing around, 
the streamlines will be symmetrical relative to straight line CD. 
The pattern will not be symmetrical, however, relative to straight 
line AB . The streamlines are closer together near point C , therefore 
the pressure here will be lower than near point Z>, and the lift P 
appears. A lift appears similarly in a viscous fluid. 

The force keeping an aeroplane in the air is the lift acting on its 
wings. The drag is harmful during the flight of an aeroplane. This 
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is why the wings of an aeroplane and its fuselage are given a well 
streamlined shape. The profile of an aerofoil (wing) must also ensure 
an adequate lift. The profile shown in Fig. 9.19, found by the out- 
standing Russian scientist Nikolai Zhukovsky (1847-1921) is the 
optimal one for an aerofoil. The works of Zhukovsky and his pupil 
S. Chaplygin laid the foundation of modern aerodynamics. V. Lenin 
called Zhukovsky the father of Russian aviation. Zhukovsky, in 
particular, derived a formula for determining the lift that is the 
basis of all aerodynamic calculations of aeroplanes. 



PART II MOLECULAR 

PHYSICS AND 
THERMODYNAMICS 


CHAPTER 10 GENERAL INFORMATION 


10.1. Statistical Physics 

and Thermodynamics 

Molecular physics is a branch of physics studying the structure 
and properties of a substance on the basis of the so-called molecular- 
kinetic notions. According to these notions, any body — solid, li- 
quid, or gaseous — consists of an enormous number of exceedingly 
small separate particles — molecules. (Atoms can be considered as 
monatomic molecules.) The molecules of a substance are in disor- 
dered, chaotic motion having no predominating direction. Its in- 
tensity depends on the temperature of the substance. 

A direct proof of the existence of chaotic motion of molecules is 
Brownian motion. This phenomenon consists in that very small 
(visible only in a microscope) particles suspended in a fluid are 
always in a state of continuous chaotic motion that does not depend 
on external causes and is a manifestation of the internal motion of 
the substance. The Brownian motion of particles is due to their 
chaotic collisions with molecules. 

The object of the molecular-kinetic theory is to interpret the 
properties of bodies that are directly observed in experiments (pres- 
sure, temperature, etc.) as the summary result of the action of mole- 
cules. It uses the statistical method and is interested not in the 
motion of separate molecules, but only in average quantities char- 
acterizing the motion of an enormous combination of particles. 
This explains its other name — statistical physics. 

Thermodynamics also studies various properties of bodies and 
changes in the state of a substance. Unlike the molecular-kinetic 
theory, however, thermodynamics studies macroscopic properties 
of bodies and natural phenomena without being interested in their 
microscopic picture. Thermodynamics permits us to arrive at a con- 
siderable number of conclusions on how processes go on without tak- 
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ing molecules and atoms into consideration and without treating 
the processes from a microscopic standpoint. 

Thermodynamics is founded on several fundamental laws es- 
tablished as a result of generalizing a large amount of experimental 
facts. Consequently, the conclusions of thermodynamics have 
a very general nature. 

By considering the changes in the state of a substance from differ- 
ent viewpoints, thermodynamics and the molecular-kinetic theory 
mutually supplement each other, forming in essence a single entirety. 

Turning to the history of the development of molecular-kinetic 
notions, we must point out first of all that ideas on the atomistic 
structure of a substance were already advanced by the ancient Greeks. 
These ideas, however, were nothing more than a brilliant conjecture. 
In the 17th century, atomistics again came to the forefront, but as 
a scientific hypothesis instead of a conjecture. This hypothesis was 
developed especially greatly in the works of the outstanding Rus- 
sian scientist Mikhail Lomonosov (1711-1765). He attempted to 
give a single picture of all the physical and chemical phenomena 
known at his time. He proceeded from the corpuscular (according 
to modem terminology-molecular) notion of the structure of mat- 
ter. Revolting against the theory of thermogen (a hypothetic thermal 
liquid whose content in a body determines the extent of its heat- 
ing) that prevailed at his time, Lomonosov saw the “cause of heat” 
in the rotation of the particles of a body. Thus, Lomonosov in essence 
formulated molecular-kinetic ideas. 

In the second half of the 19th century and at the beginning of 
the 20th century, atomistics became a scientific theory owing to the 
works of a number of scientists. 


10.2. Mass and Size of Molecules 

The masses of atoms and molecules are characterized by using 
quantities known as the relative atomic mass of an element (the 
atomic mass in short) and the relative molecular mass of a substance 
(the molecular mass). (These quantities were previously called the 
atomic weight and the molecular weight, respectively). 

The atomic mass (A r ) of a chemical element is defined as the ratio 
of the mass of an atom of the element to 1/12 of the mass of the atom 
C 12 (this is the symbol for the carbon isotope with a mass number 
of 12). The molecular mass ( M r ) of a substance is defined as the 
ratio of the mass of a molecule of the substance to 1/12 of the mass 
of the atom C 12 . Their definitions show that the atomic and molecu- 
lar masses are dimensionless quantities. 

A unit of mass equal to 1/12 of the mass of the atom C 12 is called 
the atomic mass unit (amu). Let us denote the value of this unit 
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expressed in kilogrammes by the symbol Hence, the mass of 

an atom expressed in kilogrammes will be A r m un , and the mass of 
a molecule will be M r m un . 

The amount of a substance containing a number of particles 
(atoms, molecules, ions, electrons, etc.) equal to the number of 
atoms in 0.012 kg of the carbon isotope C 12 is called a mole (the 
mole is a basic unit of the SI system). Multiple and submultiple 
units are also used such as the kilomole (kmol), the millimole (mmol) 
and the micromole (pmol). 

The number of particles contained in a mole of a substance is 
called the Avogadro constant. It was found experimentally that 
the Avogadro constant is 

N A = 6.023 x 10 23 mol" 1 ( 10 . 1 ) 

Thus, for example, a mole of copper contains N A atoms of copper, 
a mole of water contains N a molecules of water, a mole of electrons 
contains N A electrons, etc. 

The mass of a mole is called the molar mass M. It is evident that 
M equals the product of N a and the mass of a molecule M I m nn : 

M = N A M T m un (10.2) 

For carbon C 12 , we have M = 0.012 kg/mol, and the mass of an 
atom is 12m un . Substitution of these values in Eq. (10.2) yields 

0.012 (kg/mol) = N a (mol -1 ) x 12m,!,! (kg) 

Hence, 

„ / 1 ._\ 0.001 (kgmol-*)_ 0.001 

un 4 g ' — N a (mol -1 ) — 6.023 x 10» “ 

= 1.66 x 10" 27 kg = 1.66 x 10" 24 g ( 10 . 3 ) 

Hence, the mass of any atom is 1.66 X 10~ 27 A r kg, and the mass 
of any molecule is 1.66 X 10 -27 ilf r kg. 

It can be seen from Eq. (10.3) that the product N a^ub equals 
0.001 kg/mol. Introducing this value in Eq. (10.2) we find that 

M = 0.001 Af r kg/mol ( 10 . 4 ) 

or 

M = M t g/mol (10.5) 

Thus, the mass of a mole expressed in grammes numerically equals 
the relative molecular mass. It must be borne in mind, however, 
that whereas M T is a dimensionless quantity, M has a dimension 
and is measured in kg/mol (or g/mol). 

Now let us assess the size of molecules. It is natural to assume 
that molecules in a liquid are quite close to one another. We can 
therefore approximately find the volume of one molecule by dividing 
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the volume of a mole of a liquid, for example, water, by the number 
of molecules in a mole N A . One mole (i.e. 18 g) of water occupies 
a volume of 18 cm 8 = 18 X 10 ~ 6 m 8 . Hence, the volume falling to 
one molecule is 


48 x ICh* 
6 X 40 s * 


= 30 X 10~ 30 m 8 


It follows that the linear dimensions of water molecules are ap- 
proximately _ 

V 30 X 10- 30 «3x 10-‘° m = 3 A 

The molecules of other substances also have dimensions of the 
order of a few angstroms. (The angstrom — A — is a non-system unit 
of length equal to 10 -10 m. It is very convenient in atomic physics.) 


10.3. State of a System. Process 

We shall call a combination of bodies being considered a sys- 
tem of bodies or simply a system. An example of a system is a liquid 
and the vapour in equilibrium with it. Particularly, a system may 
consist of one body. 

Amy system can be in different states distinguished by their 
temperature, pressure, volume, etc. Such quantities characterizing 
the state of a system are called parameters of state. 

A parameter does not always have a definite value. If, for exam- 
ple, the temperature at different points of a body is not the same, 
then a definite value of the parameter T cannot be ascribed to the 
body. In this case, the body is said to be in a non-equilibrium state. 
If such a body is isolated from other bodies and left alone, then its 
temperature will level out and take on the same value T for all 
points — the body will pass over into an equilibrium state. This 
value of T will not change until the body is brought out of its equi- 
librium state by external action. 

The same may also occur with other parameters, for instance, 
with the pressure p. If we take a gas confined in a cylindrical vessel 
closed with a tightly fitted piston and begin to rapidly move the 
latter in, then a gas cushion will be formed under it in which the 
pressure will be greater than in the remaining volume of the gas. 
Consequently, the gas in this case cannot be characterized by a 
definite value of the pressure p, and its state will be a non-equilibrium 
one. If we stop the movement of the piston, however, then the pres- 
sure at different points of the volume will level out, and the gas will 
pass over to an equilibrium state. 

The process of transition of a system from a non-equilibrium 
state to an equilibrium one is called a relaxation process, or simply 
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relaxation. The time needed for such a transition is called the rela- 
xation time. The relaxation time is defined as the time 
in which the initial deviation of a quantity from its equilibrium 
value diminishes e times, where e is the base of natural logarithms. 
Each parameter of a system has its own relaxation time. The great- 
est of these times plays the part of the relaxation time of the 
system. 

Thus, by an equilibrium state of a system is meant a state in 
which all the parameters of the system have definite values remain- 
ing constant as long as is desired in unchanging external conditions*. 

If we lay off the values of two parameters along coordinate axes, 
then any equilibrium state of a system can be depicted by a point 

on the coordinate plane (see, for example, 
point 1 in Fig. 10.1). A non-equilibrium 
state cannot be depicted in this way because 
at least one of the parameters will not have 
a definite value in this state. 

A process, i.e. a transition of a system 
from one state to another, is associated with 
violation of the equilibrium of the system. 
Therefore, when a process occurs in a system, 
it passes through a sequence of non-equilib- 
rium states. Reverting to the process of com- 
pressing a gas in a vessel closed with a piston that we have considered, 
we can conclude that the violation of equilibrium in moving in the 
piston is the greater, the faster the gas is compressed. If we move the 
piston in very slowly, equilibrium will be violated insignificantly, 
and the pressure at different points differs only slightly from a cer- 
tain average value p. In the limit, if the gas is compressed infinitely 
slowly, it will be characterized at each moment by a definite value 
of the pressure. Consequently, the state of the gas at each moment 
in this case is an equilibrium one, and the infinitely slow process 
will consist of a sequence of equilibrium states. 

A process consisting of a continuous sequence of equilibrium 
states is called an equilibrium or a quasistatic one. It follows from 
what has been said above that only an infinitely slow process can 
be an equilibrium one. Real processes, when they occur sufficiently 
slowly, can approach an equilibrium one as close as desired. 

An equilibrium process can be conducted in the reverse direction. 
The system will pass through the same states as in the forward proc- 
ess, but in the opposite sequence. This is why equilibrium processes 
are also called reversible. 


• When a gas' is in an external force field (for example, in the field of the 
force of gravity), its equilibrium state will set in at a pressure changing regu- 
larly from point to point (see Sec. 10.14). 
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A reversible (i.e. equilibrium) process can be depicted on a coor- 
dinate plane by the relevant curve (see Fig. 10.1). We shall condi- 
tionally depict irreversible (i.e. non-equilibrium) processes by dash 
curves. 

A process in which a system after a number of changes returns 
to its initial state is called a cyclic process or a cycle. The latter is 
depicted graphically by a closed curve. 

The concepts of an equilibrium state and a reversible process 
play a great part in thermodynamics. All the quantitative conclu- 
sions of thermodynamics are strictly applicable only to equilibrium 
states and reversible processes. 


10.4. Internal Energy of a System 

The internal energy of a body is defined as the energy of this 
body less the kinetic energy of the body as a whole and the potential 
energy of the body in the external foice field. For example, in deter- 
mining the internal energy of a mass of gas, we must not take into 
consideration the energy of motion of the gas together with the ves- 
sel containing it, and the energy due to the gas being in the field 
of the Earth’s gravitational forces. 

Hence, the concept of internal energy includes the kinetic energy 
of the chaotic motion of molecules, the potential energy of interaction 
between the molecules, and the intramolecular energy*. 

The internal energy of a system of bodies equals the sum of the 
internal energies of each of them separately and the energy of inter- 
action between the bodies. The latter is the energy of intermolecular 
interaction in a thin layer on the interface between the bodies. This 
energy is so small in comparison with the energy of macroscopic 
bodies that it may be disregarded, and we may consider the inter- 
nal energy of a system of macroscopic bodies as the sum of the inter- 
nal energies of the bodies forming the system. The internal energy 
is thus an additive quantity. 

The internal energy is a function of state of a system. This sig- 
nifies that whenever a system is in a given state, its internal energy 
takes on the value characterizing this state regardless of the previous 
history of the system. Hence, the change in the internal energy when 
a system passes from one state to another will always equal the dif- 
ference between the values of the internal energy in these states re- 
gardless of the path followed by the transition. In other words, the 
change in the internal energy does not depend on the process or 
processes that caused the system to pass from one state to another. 

* This definition should be treated as a preliminary one. In statistical phy- 
sics, the concept of internal energy is defined more precisely. A discussion of 
this more precise definition is beyond the scope of a general course in physics. 
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10.5. The First Law of Thermodynamics 

The internal energy can change in the main at the expense of two 
different processes: the performance of the work A' on a body and 
the imparting of the heat Q to it. The doing of work is attended by 
the displacement of the external bodies acting on the system. For 
example, when we move in the piston closing a vessel with a gas, 
the piston when moving does the work A' on the gas. According 
to Newton’s third law, the gas, in turn, does the work A = —A' 
on the piston. 

The imparting of heat to a gas is not associated with the motion 
of external bodies and is therefore not associated with the doing 
of macroscopic (i.e. relating to the entire complex of molecules 
which the body consists of) work on the gas. In this case, the change 
in the internal energy is due to the fact that separate molecules of 
the hotter body do work on separate molecules of the colder one. 
Energy is also transferred here by radiation. The combination of 
microscopic (i.e. involving not an entire body, but separate mole- 
cules of it) processes is called heat transfer. 

Just as the amount of energy transferred by one body to another 
is determined by the work A done by the bodies on each other, the 
amount of energy transmitted from one body to another by heat 
transfer is determined by the amount of heat Q transferred by one 
body to the other. Thus, the increment of the internal energy of 
a system must equal the sum of the work A' done on the system and 
the amount of heat Q imparted to it: 

U 2 — U^Q + A' (10.6) 

Here U ± and U 2 are ft e initial and final values of the internal energy 
of the system. It is customary practice to consider the work A (equal 
to — A') done by a system on external bodies instead of the work A ' 
done by external bodies on the system. Introducing — A in Eq. (10.6) 
instead of A' and solving it relative to Q , we have 

Q = u % - U x + A (10.7) 

Equation (10.7) expresses the law of energy conservation and 
forms the content of the first law of thermodynamics. It can be put 
in words as follows: the amount of heat imparted to a system is spent 
on an increment of the internal energy of the system and on the work 
done by the system on external bodies . 

What has been said above does not at all signify that the inter- 
nal energy of a system always grows when heat is imparted to it. 
It may happen that notwithstanding the transfer of heat to a system, 
its energy diminishes instead of growing (U 2 < U x ). In this case 
according to Eq. (10.7), we have A > Q, i.e. the system does work 
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both at the expense of the heat Q it has received and at the expense 
of its store of internal energy, whose decrement is U x — £/*. It must 
also be borne in mind that the quantities Q and A in Eq. (10.7) are 
algebraic ones (Q <Z 0 signifies that the system actually gives up 
heat instead of receiving it). 

Examination of Eq. (10.7) shows that the amount of heat Q can 
be measured in the same units as work or energy. In the SI system, 
the unit of the amount of heat is the joule. 

A special unit called the calorie is also used to measure the amount 
of heat. One calorie equals the amount of heat needed to raise 1 g 
of water from 19.5 to 20.5 °C*. One kilocalorie equals 1000 calories. 

It was established experimentally that one calorie is equivalent 
to 4.18 J. Hence, one joule is equivalent to 0.24 cal. The quantity 
E — 4.18 J/cal is called the mechanical equivalent of heat. 

If the quantities in Eq. (10.7) are expressed in different units, 
then some of them must be multiplied by the appropriate equiva- 
lent. For example, if we express Q in calories and U and A in joules, 
Eq. (10.7) must be written in the form 

EQ = U 2 - U ± + A 


We shall always assume in the following that Q , A and U are 
expressed in the same units, and write the equation of the first 
law of thermodynamics in the form of Eq. (10.7). 

In calculating the work done by a system or the heat received by 
it, we usually have to divide the process being considered into a num- 
ber of elementary ones, each of which corresponds to a very small 
(infinitely small in the limit) change in the parameters of the system. 
Equation (10.7) has the following form for an elementary process: 

A'Q = AU + A' A (10.8) 

where A'Q = elementary amount of heat 
A'^4 = elementary work 

AU = increment of the internal energy of the system in the 
course of the given elementary process. 

It is very important to bear in mind that A'Q and A'^4 must never 
be considered as increments of the quantities Q and A . The change A 
in a quantity f corresponding to an elementary process may be 
considered as the increment of this quantity only if 2 A/ correspond- 
ing to the transition from one state to another does not depend on 
the path along which the transition occurs, i.e. if the quantity / is 
a function of state. With respect to a function of state, we can speak 


* The calorie defined in this way is the 20-degree calorie. Also used are the 
15-degree calorie and the mean calorie — 1/100 the heat needed to raise 1 g of 
water from 0 to 100 °G. 
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of its “store” in each state. For example, we can speak of the store 
of internal energy that a system has in different states. 

We shall see in the following that the quantity of work done by 
a system and the amount of heat it receives depend on the path 
followed by the system in its transition from one state to another. 
Hence, neither Q nor A are functions of state, and for this reason 
we cannot speak of the store of heat or work that a system has in 
different states. 

Thus, the symbol A before A and Q is given a different meaning 
than that before U. To stress this circumstance, the A is primed in 
the former case. The symbol A U signifies an increment of the inter- 
nal energy, whereas the symbols A 'Q and A 'A signify not an incre- 
ment, but an elementary amount of heat and work. 

To perform calculations, we pass over to differentials in Eq. (10.8). 
The equation of the first law thus acquires the following form*: 

d'Q = dU + d'A (10.9) 

Integration of Eq. (10.9) over the entire process results in the expres- 
sion 

Q = (u*- ffi) + A 

that is identical with Eq. (10.7). 

We stress again that, for example, the result of integration of 
d'A must not be written in the form 

2 

{ d'A = A 2 -A i 

i 

This form would mean that the work done by a system equals the 
difference between the values (i.e. the stores) of the work in the 
second and first states. 

10.6. Work Done by a Body upon 
Changes in Volume 

The interaction of a given body with bodies in contact with it 
can be characterized by the pressure which it exerts on them. We 
can use pressure to describe the interaction of a gas with the walls 
of a vessel, and also of a solid or a liquid body with the medium 
(for example, a gas) surrounding it. The displacement of the points 
of application of the interaction forces is attended by a change in 
the volume of a body. Hence, the work done by a given body on 

* In Eq. (10.9), dU is a total differentia], while d'Q and d'A are not total 
differentials (see Sec. 3.4). 
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external bodies can be expressed through the pressure and changes 
in the body’s volume. Let us consider the following example to 
find this expression. 

Assume that a gas is confined in a cylindrical vessel closed with 
a tightly fitting easily sliding piston (Fig. 10.2). If for some reason 
or other the gas begins to expand, it will move the piston and do work 
on it. The elementary work done by the gas in moving the piston 
through the distance A h is 

A 'A = FAh 

where F is the force with which the gas acts on 
the piston. Substituting for this force the product 
of the gas pressure p and the piston area S, we 
have 

A 'A = pSAh 

But SAh is the increment of the volume of the 
gas AF. Hence, the expression for the elemen- 
tary work can be written as follows: 

A' A = pAV (10.10) 

The quantity A' A in Eq. (10.10) is obviously an algebraic one. 
Indeed, in compression of the gas, the directions of the displacement 
Ah and the force F with which the gas acts on the piston are oppo- 
site. Consequently, the elementary work A' A will be negative. The 
increment of the volume AF in this case will also be negative. Thus, 
Eq. (10.10) gives a correct expression for the work upon any changes 
in the volume of the gas. 

If the pressure of the gas remains constant (for this to occur we 
must simultaneously change the temperature in the appropriate 
direction), the work done when the volume changes from V x to V 2 
will be 

A t2 = p(V 2 -V t ) (10.11) 

If a change in the volume is attended by a change in the pressure, 
then Eq. (10.10) holds only for sufficiently small AF’s. In this case, 
the work done upon finite changes in the volume must be computed 
as the sum of elementary amounts of work expressed by Eq. (10.10), 
i.e. by integration: 

v, 

A i2 =jpdV (10.12) 

V t 

The expressions found for the work hold for any changes in the 
volume of solid, liquid, and gaseous bodies. Let us consider another 
example to convince ourselves that this is true. Let us take a solid 
body of an arbitrary shape immersed in a liquid or gaseous medium 


1 1 1 

p 


_L 

T 


Fig. 10.2 
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that exerts on the body the pressure p identical at all points 
(Fig. 10.3). Assume that the body expands so that separate elemen- 
tary portions of its surface A S x receive different displacements A h t . 

Hence, the i-th portion does the work 
A 'A t equal to pASjAfcj. The work done 
by the body can be found as the sum of 
the amounts of work done by separate 
portions: 

A 'A = 2 A'A, = 2 />AS| AAj 

Factoring out of the sum the value of p 
which is identical for all the portions and 
noting that 2 AS { Afcj gives the incre- 
ment of the body’s volume AF, we can 
write that A 'A = p AF, i.e. in the gen- 
eral case too we arrive at Eq. (10.10). 

Let us depict the process of the change 
in the volume of the body in a p-V 
diagram (Fig. 10.4). The area of the narrow hatched strip 
in the diagram corresponds to the elementary work A 'A ( = p t AF|. 
It is obvious that the area confined between the F-axis, the curve 


Pi 


p — f (V), and the perpendiculars erected from points V x and F a 
numerically equals the work done when the volume changes from 
y 1 to F a . The work done in a cyclic process numerically equals the 
area enclosed by the curve (Fig. 10.5). Indeed, the work on path 1-2 
is positive and numerically equals the area hatched diagonally to 
the right (we are considering a clockwise cycle). The work on path 2-1 
is negative and numerically equals the area hatched diagonally to 
the left. Hence, the work during a cycle numerically equals the 
area enclosed by the curve. It will be positive in the direct cycle 



v, W * 
Fig. 10.4 
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(i.e. in one conducted in the clockwise direction), and negative 
in the reverse cycle. 

It is clear from what has been said in Sec. 10.3 that the equations 
we have obtained may be applied only to reversible processes. 

We must note that by using Eq. (10.10) (with a transition to differ- 
entials), Eq. (10.9) expressing the first law of thermodynamics can 
be written as follows: 


d'Q = dU + p dV 


(10.13) 


10.7. Temperature 

We can arrive at a definition of the concept of temperature on the 
basis of the following reasoning. If contacting bodies are in a state 
of thermal equilibrium, i.e. do not exchange energy by heat trans- 
fer, they are said to have the same temperature. If when thermal 
contact is established between bodies one of them transmits energy 
to the other by heat transfer, then the first body is said to have a 
higher temperature than the second one. Many properties of bodies 
such as their volume and electrical resistance depend on the tem- 
perature. Any of these properties can be used for a quantitative 
definition of temperature. 

Let us bring the body we have chosen for measuring the tempera- 
ture (a thermometric body) into thermal equilibrium with melting 
ice. We shall assume that the body has a temperature of 0 degrees 
and shall characterize quantitatively the property of the body (the 
temperature feature) which we intend to use for measuring the tem- 
perature. Let this feature be the volume of the body. Its value at 0 
degrees is V 0 . Next we shall bring the same body into thermal equi- 
librium with water boiling under atmospheric pressure. Now we 
shall assume that the body in this state has a temperature of 100 de- 
grees, and shall determine the corresponding volume Presuming 
that the temperature feature we have chosen (the volume in the 
given example) changes linearly with the temperature, we can ascribe 
the following temperature to the state in which our thermometric 
body has the- volume V: 

t = J — V jt- X 100 degrees (10.14) 

The temperature scale established in this way is called, as is known, 
the Celsius scale. An expression similar to Eq. (10.14) can also be 
written for the case when we use another temperature feature instead 
of the volume to measure the temperature. 

After graduating a thermometer in this way, we can use it to 
measure the temperature by bringing it into thermal equilibrium 
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with the body whose temperature we are interested in, and calcu- 
lating the value of the volume. 

When we compare thermometers functioning with different ther- 
mometric bodies (for example, mercury and alcohol) or different 
temperature features (for example, volume and electrical resistance), 
we find that the readings of these thermometers, which coincide at 
0 and 100 degrees owing to their being graduated at these tempera- 
tures, do not coincide at other temperatures. It thus follows that for 
the unique definition of a temperature scale, in addition to the 
way of graduation, we must also arrive at an agreement on the choice 
of the thermometric body and the temperature feature. How this 
choice is made in establishing the so-called empirical temperature 
scale will be treated in the following section. Getting ahead, we shall 
indicate that the second law of thermodynamics can serve as the 
basis of a temperature scale not depending on the properties of the 
thermometric body (see Sec. 12.3). This scale is called the 
thermodynamic temperature scale. 

The international practical temperature scale of ^1968, formerly 
called the Celsius (centigrade) scale is used in engineering and for 
everyday purposes. Physicists find the absolute scale more con- 
venient. The temperature T measured according to this scale is related 
to the temperature t according to the Celsius scale by the equation 

T = t + 273.15 

The unit of absolute temperature is the kelvin (K). It was previous- 
ly called the degree Kelvin (°K). The international practical tem- 
perature is measured in degrees Celsius (°C). The sizes of the kel- 
vin and the degree Celsius are the same. A temperature of 0 K is 
referred to as absolute zero, and t = — 273.15 °C corresponds to it. 

In the following (see Sec. 11.5), we shall show that the absolute 
temperature is proportional to the mean kinetic energy of transla- 
tional motion of the molecules of a substance. This is the physical 
meaning of absolute temperature. 


10.8. Equation of State of an Ideal Gas 

The state of a given mass of a gas is determined by the values of 
three parameters: the pressure p, volume F, and temperature T. 
These parameters are related to one another according to a definite 
law so that a change in one of them causes a change in the others. 
This relation can be given analytically in the form of the function 

F (p, F, T) = 0 (10.15) 

An expression determining the relation between the parameters 
of a body is called an equation of state of the body. Hence, Eq. (10.15) 
is an equation of state of a given mass of a gas. 
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A gas, the interaction between whose molecules is negligibly 
small, has the simplest properties. Such a gas is called ideal (or 
perfect). The interaction between the molecules of any gas becomes 
negligibly small at a great rarefaction, i.e. at low densities of the 
gas. A real gas upon sufficient rarefaction is close in its properties 
to an ideal one. Some gases such as air, nitrogen, and oxygen differ 
only slightly from an ideal gas even in usual conditions, i.e. at 
room temperature and atmospheric pressure. Helium and hydrogen 
are especially close to an ideal gas in their properties. 

Gases at low densities obey the following equation with a good 
accuracy: 

'y- = const (10.16) 

Consequently, this equation is an equation of state of an ideal gas. 

According to the law established by Amadeo Avogadro (1776-1856), 
the moles of all gases occupy an identical volume in identical con- 
ditions (i.e. at the same temperature and pressure). In particular 
in the so-called standard conditions, i.e. at 0 °C and a pressure of 
l atm (1.01 xl0 6 Pa), the volume of a mole of any gas is 22.4 dm 3 /mol = 
= 22.4 X 10~ 3 m 3 /mol. It thus follows that when the amount 
of a gas is one mole, the value of the constant in Eq. (10.16) will 
be the same for all gases. Denoting the value of this constant corres- 
ponding to one mole by the symbol /?, we can write Eq. (10.16) 
as follows: 

pV m = RT (10.17) 

We have used the subscript “m” with V to show that we are dealing 
with the volume occupied by one mole of a gas at the given p and T . 
Equation (10.17) is an equation of state of an ideal gas written for 
one mole. 

The quantity R is called the molar gas constant. According to 
Eq. (10.17) and Avogadro’s law, 

r, pV m 1.01 X 10 5 X 22.4 X 10-* Pa-m*/mol _ Q 0/l J 

T ” 273 K o.ol mo] .R 

For practical calculations, it is sometimes convenient to use R 
expressed in litre-atmospheres per mole-kelvin: 

It is a simple matter to pass over from Eq. (10.17) for one mole 
to an equation for any mass m, taking into account that at the same 
pressure and temperature v moles of a gas will occupy a volume v 
times greater than that occupied by one mole: V = vF m . Multi- 
plying Eq. (10.17) by v = m!M (here m is the mass of the gas and M 
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the mass of a mole) and introducing V instead of vV m , we get 

pV-^RT (10.18) 


This equation is an equation of state of an ideal gas written for the 
mass m of a gas. 

Equation (10.18) can be given a different form. For this purpose, 
we shall introduce the quantity 




R 


(10.19) 


(R is the molar gas constant, and N A is the Avogadro constant). 
This quantity is known as the Boltzmann constant. It has a deeper 
physical meaning than the constant R. We shall show in Sec. 11.5 
that k is the constant of proportionality between the mean energy 
of thermal motion of a molecule and the absolute temperature. 
Substitution of the numerical values for R and A a in Eq. (10.19) 
yields 


k = 


8.31 J/(mol-K) 
6.023 X 10*3 mol" 1 


= 1.38 X IO- 2 3 J/K 


Let us multiply and divide the right-hand side of Eq. (10.18) by 
N A . The equation can therefore be written in the form 

P V = vN A kT 


The product vN A equals the number of molecules N contained in 
the mass m of a gas. Taking this into consideration, we find that 

pV = NkT (10.20) 

Now let us divide both sides of Eq. (10.20) by V. Since N/V = n 
is the number of molecules in a unit volume, we arrive at the equa- 
tion 

p = nkT (10.21) 


Equations (10.18), (10.20), and (10.21) ai;e different forms of 
writing the equation of state for an ideal gas. 

The ratio of the mass of a gas to the volume it occupies gives the 
density of the gas: p = m/V. According to Eq. (10.18), the density 
of an ideal gas is determined by the expression 


P = 


Mp 

RT 


( 10 . 22 ) 


Thus, the density of an ideal gas is proportional to the pressure 
and inversely proportional to the temperature. 

The simple relation between the temperature and the remaining 
parameters of an ideal gas makes it tempting to use it as a thermo- 
metric substance. Ensuring a constant volume and using the pres- 
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sure of the gas as the temperature feature, we can obtain a thermo- 
meter with an ideally linear temperature scale. In the following, we 
shall call this scale the ideal gas temperature scale. 

In practice, according to an international agreement, hydrogen 
is taken as the thermometric body. The scale established for hyd- 
rogen with the use of Eq. (10.18) is called the empirical tempera- 
ture scale. 


10.9. Internal Energy and Heat Capacity 
of an Ideal Gas 

Experiments show that the internal energy of an ideal gas depends 
only on the temperature: 

U = BT (10.23) 

Here B is a coefficient of proportionality that remains constant 
within quite a broad range of temperatures. 

The failure of the internal energy to depend on the volume occupied 
by a gas indicates that the molecules of an ideal gas do not interact 
with one another the overwhelming part of the time. Indeed, if the 
molecules did interact with one another, the internal energy would 
contain as an addend the potential energy of interaction, and the 
latter would depend on the mean distance between the molecules, 
i.e. on W 3 . 

It must be noted that interaction should take place upon colli- 
sions, i.e. when the molecules come very close to one another. Such 
collisions are very few in number in a rarefied gas, however. Each 
molecule spends the predominating part of its time in free flight. 

The heat capacity of a body is defined as the quantity equal to the 
amount of heat that must be imparted to the body to raise its temperature 
by one kelvin. If the amount of heat d'Q imparted to a body raises 
its temperature by dT, then its heat capacity by definition is 

C bod7 = ^- (10.24) 

This quantity is measured in joules per kelvin (J/K). 

We shall denote the capacity of a mole of a substance, called 
the molar heat capacity, by the symbol C. It is measured in joules 
per mole-kelvin lJ/(mol-K)]. 

The heat capacity of a unit mass of a substance is called the spe- 
cific heat capacity. We shall use the symbol c for it. The quantity 
c is measured in joules per kilogramme-kelvin [ J/(kg- K)]. 

The following relation obviously holds between the heat capacity 
of a mole of a substance and the specific heat capacity of the same 
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substance: 



(10.25) 


(M is the molar mass). 

The value of the heat capacity depends on the conditions in which 
a body is heated. The heat capacity for heating at a constant volume 
or a constant pressure is of the greatest interest. The heat capacities 
at constant volume and constant pressure are designated by C v and 
C p , respectively. 

When heating occurs at constant volume, a body does no work 
on external bodies, and, consequently, according to the first law 
of thermodynamics [see Eq. (10.9)1, all the heat is spent on the 
increment of the internal energy of the body: 

d'Q v = dU (10.26) 

It can be seen from Eq. (10.26) that the heat capacity of any body 
at constant volume is 

Cr -(^), < 10 - 27 > 

This notation stresses the fact that when differentiating the expres- 
sion for U with respect to T , the volume must be considered constant. 
For an ideal gas, U depends only on T , and Eq. (10.27) can be writ- 
ten in the form 


(to obtain the molar heat capacity of a gas, we must take the internal 
energy of a mole). 

Equation (10.23) for one mole of a gas has the form U m = B m T . 
Differentiating it with respect to T, we find that Cy = B m . Thus, 
the expression for the internal energy of one mole of an ideal gas 
can be written in the form 

U m = C V T (10.28) 

where C v is a constant quantity — the molar heat capacity of a gas 
at constant volume. 

The internal energy of an arbitrary mass m of a gas will equal the 
internal energy of one mole multiplied by the number of moles of 
the gas in the mass m: 

TJ = CyT (10.29) 

If a gas is heated at constant pressure, it will expand, doing 
positive work on external bodies. Consequently, more heat will be 
needed to raise the temperature of the gas by one kelvin in this case 
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than when heating it at constant volume — part of the heat will be 
used by the gas to do work. Hence, the heat capacity at constant 
pressure must be greater than that at constant volume. 

Let us write Eq. (10.13) of the first law of thermodynamics for 
a mole of a gas: 

d Q p = dU m + p dV m (10.30) 

The subscript p of d'Q in this expression indicates that heat is 
imparted to the gas in conditions when p is constant. Dividing 
Eq. (10.30) by dr, we get an expression for the molar heat capacity 
of a gas at constant pressure: 

+'’(%•)» < io - 3i > 

The addend dU m /dT equals, as we have seen, the molar heat capacity 
of a gas at constant volume. Therefore, Eq. (10.31) can be written 
as follows: 

c*= c v+p(ilr) p ( 10 - 32 > 

The quantity ( dV m /dT) p is the increment of the volume of a mole 
when the temperature is raised by one kelvin obtained with p being 
constant. According to the equation of state (10.17), we have F m = 
= RT/p. Differentiating this expression with respect to T provided 
that p = const, we find 

/ ffVm \ = R_ 

\ dT )p p 


Finally, using this result in Eq. (10.32), we get 

C p = C v + R (10.33) 


Thus, the work done by a mole of an ideal gas when its temperature 
is raised by one kelvin at constant pressure equals the molar gas 
constant. It must be noted that Eq. (10.33) has been obtained by 
using an equation of state for an ideal gas and, consequently, holds 
only for an ideal gas. 

The quantity 


y— 


Cp_ 

Cy 


(10.34) 


is a quantity characterizing every gas. For monatomic gases, its 
value is close to 1.67, for biatomic gases to 1.4, for triatomic gases 
to 1.33, etc. In the following (see Sec. 11.5), we shall see that the 
value of y is determined by the number and the nature of the degrees 
of freedom of the molecule. 
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Substituting for C p in Eq. (10.34) its value from Eq. (10.33), we 
have 


v=^=<+£ 


whence 


Cv = 


v— l 


(10.35) 


Using this value of C v in Eq. (10.29), we get the following expres- 
sion: 


m RT 
M y — 1 


(10.36) 


Comparison with Eq. (10.18) gives still another expression for the 
internal energy of an ideal gas: 



(10.37) 


10.10. Equation of Adiabat 
of an Ideal Gas 

In the course of a process, a gas, in addition to an equation of 
state, obeys another condition determined by the nature of the 
process. For example, the condition p = const is observed in the 
so-called isobaric process. The condition V = const holds in an 
isochoric process. Finally, in an isothermal process, T = const. 
For an ideal gas, the condition T = const is equivalent to the con- 
dition 

pV = const (10.38) 

Equation (10.38) is called the equation of an isotherm of an ideal 
gas, and the curve determined by this equation is named an isotherm. 

A process going on without heat exchange with the surroundings 
is called adiabatic. Let us find an equation relating the parameters 
of an ideal gas in an adiabatic process. Introducing Eq. (10.29) for 
dU of an ideal gas into Eq. (10.13) of the first law of thermodynamics, 
we obtain 


d’Q = C v dT + pdV 

Since for an adiabatic process we have d'Q = 0, the following con- 
dition must be observed: 

-ZL-CydT+pdV^ 0 


(10.39) 
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Now let us express p through V and T in accordance with the 
equation of state for an ideal gas (10.18): 

„ m RT 

P M V 


and introduce this expression into Eq. (10.39). As a result, after 
cancelling the factor ml M differing from zero, we get 

C v dT + RT ^ = 0 


Let us transform the above expression as follows: 


dT . R dV 
T ' Cy ~V~ 


0 


This expression can be written in the form 

*(!■>*•+£ l-^-O 


whence it follows that in an adiabatic process 

In 2 1 + In F = const (10.40) 

In accordance with Eq. (10.35), the ratio R/C v can be replaced 
with y — 1, where y = CplC v . Making this substitution in Eq. (10.40) 
and converting to a power, we get 

ZV v-1 = const (10.41) 

This equation is an equation of an adiabat of an ideal gas in 
variable T and F. We can pass over from this equation to one in 
variable p and V by replacing T in it with p and V in accordance 
with the equation of state for an ideal gas (10.18): 

rp M pV 

1 m R 

Using this expression in Eq. (10.41) and taking into account that 
M and R are constants, we get 

pVv = const (10.42) 

[The values of the constants in Eqs. (10.40)-(10.42) are obviously 
different.] 

Expression (10.42) is an equation of an adiabat of an ideal gas 
in variable p and V. It is also called the Poisson equation. 

It follows from a comparison of the adiabat equation (10.42) 
with the isotherm equation (10.38) that an adiabat is steeper than an 
isotherm. Let us calculate dp/dV for an isotherm and an adiabat at 
the same point with the coordinates p and V (Fig. 10.6). Differentia- 
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tion of Eq. (10.38) yields 

p dV + V dp = 0 

whence for an isotherm we obtain 

dp P_ 

dV~~ V 


(10.43) 


Differentiation of Eq. (10.42) yields 

p yV y ~ l dV + V y dp = 0 

whence 



Thus, the slope of an adiabat is y times greater than that of an iso- 
therm. 

We assumed in all our reasoning that the state of a gas at each 
moment is characterized by definite values of the parameters p and T y 

i.e., in other words, that the adiabatic 
process being considered is reversible. 
We know that only a very slow process 
can be reversible. At the same time, since 
nature knows of no substances that do 
not conduct heat absolutely, the amount 
of heat exchanged by a system with its 
surroundings will be the smaller, the 
shorter is the time taken by a process. 
Thus, only fast processes can be close 
to an adiabatic one. An example of such 
a process are the compression and expan- 
sion occurring at each point of a gas in 
which a sound wave is propagating. Notwithstanding the fact that 
within the confines of a large volume the state of the gas is not an 
equilibrium one (p and T are different at different points), the 
behaviour of the gas within the limits of each sufficiently small vol- 
ume is quite satisfactorily described by Eq. (10.42) of an adiabat. 



10.11. Poly tropic Processes 

Processes in which the heat capacity of a body remains constant are 
defined as polytropic ones. Thus, the condition which is observed 
in a polytropic process is 

C — const (10.44) 

Let us find the equation of a polytrope for an ideal gas. We shall 
write equation (10.13) of the first law for one mole of gas, substitut- 
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ing C dT for d'Q and C v dT for dU: 


C dT = C Y dT + p dV (10.45) 

This equation includes all three parameters: p, V and T. One of them 
can be excluded with the aid of an equation of state. To obtain an 
equation of a polytrope directly in variable p and V, let us exclude T. 
For this end, let us differentiate the equation pV = RT: 

p dV + V dp = R dT (10.46) 


Excluding dT from Eqs. (10.45) and (10.46) and bringing together 
similar terms, we get 

(C - C v - R) p dV + (C - C v ) V dp = 0 (10.47) 

Substituting C p for C v + R [see Eq. (10.33)1 and dividing Eq. (10.47) 
by pV , we arrive at the differential equation 

(C - C p ) + (C - Cy) = 0 (10.48) 


The quantities C, C p , and C v are constants. Therefore, integration 
of Eq. (10.48) gives the expression 

(C — C p ) In V + (C — C v ) In p = const (10.49) 


Dividing this expression by C — C v (which is possible if C =*£ C v ) 
and converting to a power, we get 

pV n = const (10.50) 


where 


n = 


C—C p 
C — Cy 


(10.51) 


It is exactly Eq. (10.50) that is the required equation of a poly- 
trope of an ideal gas for C ^ Cy . The quantity n determined by 
Eq. (10.51) is called the polytropic exponent or index. 

Let us turn to Eq. (10.49) to establish the nature of a polytropic 
process when C = C v . For this condition, the equation acquires 
the form ( C — C v ) In V = const, whence it follows that V in the 
course of the process remains constant. Hence, a polytropic process 
with C = Cy is an isochoric one. This could be foreseen because 
C v = const and is the heat capacity at constant volume, i.e. in 
an isochoric process. By Eq. (10.51), the polytropic exponent in 
an isochoric process equals infinity. 

The other processes treated in the preceding section also relate 
to the category of poly tropic processes. For an isobaric process, we 
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have n — 0 [see Eq. (10.50)], for an isothermal one n = 1, and, 
finally, for an adiabatic process n = y. The value of the polytropic 

exponent n for these processes are given 
in Table 10.1. 

Solving Eq. (10.51) relative to C , we 
get an equation for the heat capacity of 
an ideal gas in a polytropic process: 

C = ^Br^ ( 10 . 52 ) 

The introduction of n = y causes Eq. 
(10.52) to become equal to zero [Eq. 
(10.35) must be taken into account in 
verifying this statement]. Consequently, 
the heat capacity of an ideal gas in an 
adiabatic process equals zero. The heat capacity of all bodies vani- 
shes in an adiabatic process. This can be seen from the fact that in 
an adiabatic process d'Q = 0, whereas dT differs from zero. 

The introduction of n — 1 causes Eq. (10.52) to equal infinity. 
Thus, in an isothermal process, the heat capacity is infinitely great. 
The explanation is that in an isothermal process dT = 0, whereas 
d'Q differs from zero. 


Table 10.1 


Process 

n 

Isobaric 

0 

Isothermal 

1 

Adiabatic 

y 

Isochoric 

CO 


10.12. Work of an Ideal Gas 
in Different Processes 


The work done by a body on external bodies when it passes from 
state 1 to state 2 is 


A 12 



(10.53) 


[see Eq. (10.12)]. To perform integration, we must express p through 
V. For this purpose, we shall use the relation between p and V in 
different processes. 

Equation (10.50) of a polytrope of an ideal gas can be written as 
follows: 

pV n =PiV n t =p 2 v 5 


where p x , V ly and p a , V 2 are the values of the pressure and volume 
of the gas in the first (initial) and second (final) states, respectively, 
and p and V are the pressure and volume in any intermediate state. 
The above equation allows us to express the pressure of a gas through 
its volume and the values of the parameters in the initial or final 
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state. Taking the former, we have 



Introduction of this equation into Eq. (10.53) yields 

A l2 = p x V n l \^ r (10.54) 

V. 


Let us first consider the case when n 1; the integral in Eq. (10.54) 
for it is 

v s 

r dV I / 1 1 

J yn— V n- 1 


Using this value of the integral in Eq. (10.54) and performing simple 
transformations, we get 


A 12 




(10.55) 


This equation can be transformed by taking advantage of the fact 
that no matter what process occurs with an ideal gas, its para- 
meters are related by an equation of state. In particular, this also 
holds for the initial state: 


PtVx—jrRTi (10.56) 

Taking Eq. (10.56) into account, we can write Eq. (10.55) in the 
form 

< 10 - 57 > 

Equations (10.55) and (10.57) give the work done by an ideal 
gas in any polytropic process except for an isothermal one [which 
corresponds to n = 1. In this case, Eqs. (10.55) and (10.57) become 
indefinite]. In particular, for an adiabatic process 

or 

*«—. -(^-r 1 ] < io - 59 > 

To calculate the work of an ideal gas in an isothermal process, 
let us express the pressure in Eq. (10.53) through other quantities 
in accordance with an equation of state. The result is (we can put T 
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outside the integral since it is constant): 

A »-£-« T ] 4 r=-w llTl ‘Tt 

Vt 

Thus, the work done by an ideal gas in an isothermal process is 

(10.60) 

In an isobaric process, the work done by any body including an 
ideal gas, as can be seen from Eq. (10.53), is 

A x2 =p<y 2 -V i) (10.61) 

The same result is obtained if we assume that n = 0 in Eq. (10.55). 
We shall note in concluding that the work equals zero in an isochoric 
process. This holds for any bodies. 


10.13. Van der Waals Gas 

We mentioned in Sec. 10.8 that the behaviour of real gases is well 
described by Eq. (10.17), i.e. 

pV m = RT 

only at low densities, i.e. at not too high pressures and sufficiently 
high temperatures [see Eq. (10.22)]. Considerable deviations from 
this equation are observed with an increase in the pressure and a 
decrease in the temperature. The second column of Table 10.2 gives 
the values of the product pV for the mass of nitrogen occupying a 
volume of one litre in standard conditions. These values are given 
for different pressures and the same temperature 0°C. 

Table 10.2 


p, atm 

pV, atm-l 

(p + -pr) -6 '>> atm - 1 

1 

1.000 

1.000 

100 

0.994 

1.000 

200 

1.048 

1.009 

500 

1.390 

1.014 

1000 

2.069 

0.893 


According to Eq. (10.17), the product pV must remain constant 
when the temperature does not change. Actually, as can be seen 
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from the table, appreciable deviations are observed at a pressure 
of about 200 atm. They grow continuously with increasing pressure 
and reach over 100 per cent at 1000 atm. These deviations are not 
surprising because when the density grows, the volume of the mole- 
cules and the interaction between them begin to play a greater and 
greater part. 

A great variety of equations were proposed to describe the behav- 
iour of gases within a broad density range. The one proposed by 
J. van der Waals is the simplest of them, while giving sufficiently 
good results. This equation was obtained by introducing corrections 
into Eq. (10.17) and has the following form: 

(p + ^r) {Vm-b) = RT (10.62) 

where p is the pressure exerted on the gas from outside (equal to the 
pressure of the gas on the walls of the vessel it occupies), and a and b 
are van der Waals constants* Their values differ for different gases 
and are determined experimentally. If the pressure is measured in 
pascals and the volume in cubic metres per mole, then the constant 
a is in Pa-mVmol 2 , and the constant 6 is in m 3 /mol. Sometimes the 
constants a and b are expressed in atm-l 2 /mol 2 and 1/mol, respec- 
tively. 

Owing to the mutual attraction between its molecules, a gas, as 
it were, is compressed by a greater pressure than the pressure p 
exerted on it by the walls of the vessel confining it. The correction 
a/V m characterizes the addition to the external pressure due to the 
mutual attraction of the molecules. Molecules have an appreciable 
action on one another within the limits of small distances called 
the radius of molecular action. The force of mutual attraction of 
two elementary volumes having dimensions of the order of this 
radius is proportional both to the number of molecules contained 
in one of the volumes and to that in the other volume. Each of these 
numbers, in turn, is proportional to the number of molecules in 
unit volume, i.e. is inversely proportional to the volume of the gas. 
These considerations can be used to explain the circumstance that 
the correction to the pressure in Eq. (10.62) has the form a/V 

Since the molecules have a finite volume, the space available for 
motion of the molecules is less than the volume of the vessel v m . 
The correction b in Eq. (10.62) characterizes the part of the volume 
that is not available for motion of the molecules. In its order of 
magnitude, it equals several total volumes of the molecules con- 
tained in a mole of a gas. 

Equation (10.62) has been written for one mole of a gas. To go 
over to an equation for an arbitrary mass m, we must take into 
account that v moles of a gas in the same conditions occupy a volume 
that is v times greater: V = vF m . Substituting Vfa for F m in 
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Eq. (10.62) f we get 

(*+$?)(-?•-*)-«• 

Multiplying this equation by v and introducing the symbols 

a ' = v 2 a, b' = \b (10.63) 

we arrive at the van der Waals equation for v moles: 

(p + yr) <y-b') = vRT (10.64) 

The symbols a' and b' designate the van der Waals constants for 
v moles. Equations (10.63) show how they are related to a and b. 
The constant a' is measured in Pa - m 6 , and V in m 8 . 

How much better the van der Waals equation shows the behaviour 
of gases than Eq. (10.17) can be seen from the data contained in 
Table 10.2. The third column of the table gives the values of the 
quantity (p + a IV 1 ) (F — b') t which ought to be constant according 
to Eq. (10.64), for the same mass of nitrogen for which the values of 
pV are given in the second column. Inspection of the table shows that 
the van der Waals equation agrees much better with experimental 
data than Eq. (10.17). 

Since all real gases approach ideal gases in their properties when 
their density diminishes, the van der Waals equation in the limit, 
when the volume tends to infinity, transforms into Eq. (10.17). We 
can convince ourselves that this is true by factoring out p and V 
in Eq. (10.64): 

and taking into consideration that the product pV is approximately 
constant. 

Real gases obey the van der Waals equation only approximately. 
An imaginary gas that obeys Eq. (10.62) exactly is called a van der 
Waals gas. 

The internal energy of a van der Waals gas must include, in 
addition to the kinetic energy of the molecules, the energy of interac- 
tion between them. To find the internal energy of a van der Waals 
gas, let us take advantage of the circumstance that the work done 
in the expansion of a gas against the forces of mutual attraction of 
the molecules to one another equals the increment of the interaction 
energy: d'A — dE p . The forces of mutual attraction between the 
molecules are taken into account in Eq. (10.62) with the aid of the 
addition a/V 5, to the pressure. Accordingly, the work against the 
forces of interaction between the molecules can be represented in 
the form (a/FJ,) dV m (similarly, the work done by a gas against the 
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external forces is determined by the expression p dV ). Thus, 

dE v =*dV m 

r m 

Integration of this expression shows that 

-Ep= — + const (10.65) 

r m 

The internal energy of a van der Waals gas depends on both the 
volume and the temperature. Hence, the expression for U has the 
form 

U — f (T ) — 

v m 

Iwe have included the constant of Eq. (10.65) in / (T)l. This expres- 
sion in the limit, when the volume tends to infinity, must trans- 
form into Eq. (10.28) for the internal energy of an ideal gas. There- 
fore, f(T) = C V T. 

Thus, the internal energy of a mole of a van der Waals gas is 
determined by the equation 

U m = C v T — ~ (10.66) 

The internal energy of v moles will be v times greater: 

U = »CyT y- (10.67) 

(we have taken into consideration that v a a = a' and vF m = V). 
By Eqs. (10.66) and (10.67), we can find the approximate values of 
the internal energy of real gases. 

10.14. The Barometric Formula 

The atmospheric pressure at the altitude h is due to the weight of 
the layers of gas above this altitude. Let p be the pressure at the 
altitude h . Hence, the pressure at the altitude h + dh will be p -f- dp. 
If dh is greater than zero, then dp will be less than zero because the 
weight of the higher layers of the atmosphere and, therefore, the 
pressure, diminish with the altitude. The difference between the 
pressures p and p + dp equals the weight of the gas contained in 
the volume of a cylinder with a base area of unity and an altitude 
of dh (Fig. 10.7): 

p — (p + dp) = pg dh 

where p is the density of the gas at the altitude K Hence, 

dp = — pg dh (10.68) 

We indicated in Sec. 10.8 that air differs only slightly from an 
ideal gas in its behaviour in conditions close to standard ones. There- 
fore, the density of air can be calculated by Eq. (10.22). The intro- 





CHAPTER 11 STATISTICAL PHYSICS 


11.1. Information from the Theory 
of Probability 

Assume that we have a macroscopic system, i.e. a system formed 
by an enormous number of microparticles (molecules, atoms, ions, 
electrons), in a given state. Assume further that a quantity x charac- 
teristic of the system can have the discrete values 

x x , t • • »i • • »i x^ 

Let us make a very great number N of measurements of the quan- 
tity x, bringing the system before each measurement to the same 
initial state. Instead of performing repeated measurements of the 
same system, we can take N identical systems in the same state 
and measure the quantity x once in all these systems. Such a set of 
identical systems in an identical state is called a statistical ensemble. 

Assume that measurements gave the result x l9 N 2 measurements 
the result x 2 , . . ., N t measurements the result x t , and so on (2 = 

= N is the number of systems in the ensemble). The quantity NJN 
is defined as the relative frequency of appearance of the result x u 
while the limit of this quantity obtained when N tends to infinity, 
i.e. 

P,-= lim £f. (11.1) 

iY-*-oo 

is called the probability of appearance of the result Xt. In the follow- 
ing, in order to simplify the equations, we shall write the expres- 
sion for the probability in the form NJNy bearing in mind that the 
transition to the limit is performed at N -*>* oo. 

Since 2 Nt ~ Ny we have 

2 **- 2 - f -* 1 < 112 ) 

i.e. the sum of the probabilities of all possible results of measurement 
equals unity. 

The probability of obtaining the result or x h is 
n _ Mi + Xh _ Nt , N k n t n 

or ft “* ^y i ^y *i I * ft 

We have thus arrived at the theorem of summation of probabilities. 
It states that 


£*1 or h =* Pi + Pk 


(11.3) 
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Assume that a system is characterized by the values of two quan- 
tities x and y. Both quantities can take on discrete values whose 
probabilities of appearance are 

Let us find the probability P ( x <f y *) of the fact that a certain meas- 
urement will give the result x t for x and y k for y. The result x t is 
obtained in a number of measurements equal to N ( x t ) = P (x*) N. 
If the value of the quantity y does not depend on that of x, then the 
result y k will be obtained simultaneously with x* in a number of 
cases equal to 

N (*„ y k ) - N (*,) P (y h ) = IP (*,) N] P (y h ) 

IN (X|) plays the part of N for y]. The required probability is 

P (*i, p <**> p (y») 

Now we have arrived at the theorem of multiplication of probabilities 
according to which the probability of the simultaneous occurrence of 
statistically independent events equals the product of the probabilities 
of each of them occurring separately. 

P (i r„ y h ) = P (x t ) P (y h ) (11.4) 

Knowing the probability of the appearance of different measure- 
ment results, we can find the mean value of all the results. According 
to the definition of the mean value 

<*>“^3^ “2^ (U-5) 

Let us extend the results obtained to the case when the quantity x 
characterizing a system can take on a continuous series of values 
from zero to infinity. In this case, the quantity x is said to have a 
continuous spectrum of values (inathe previous case the spectrum of 
values was discrete). 

Let us take a very small quantity a (say, a = 10~ 6 ) and find the 
number of measurements A N 0 which give 0 < x < a, the number 
A N x which give a < x < 2a, . . ., the number AN x for which the 
result of the measurements is within the interval from x to x + a, 
and so on. The probability of the fact that the result of the measure- 
ments will be within the interval from zero to a is A P 0 = A N 0 /N, 
within the interval from a to 2a is A P x = within the 

interval from x to x + a is A P x = A N x /N. Let us draw an x-axis 
and lay off strips of width a and of height A PJa upward from it 
(Fig. 11.1a). We obtain a bar graph or histogram. The area of the 
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bar whose left-hand edge has the coordinate x is A P x , and the area 
of the entire histogram is unity [see Eq. (11.2)]. 

A histogram characterizes graphically the probability of obtaining 
results of measurements confined within different intervals of width a. 
The smaller the width of the interval a, the more detailed will the 
distribution of the probabilities of obtaining definite values of x be 
characterized. In the limit when a -*■ 0, the stepped line confining 



00 (b) 

Fig. 11.1 

the histogram transforms into a smooth curve (Fig. 11.16). The 
function / (x) defining this curve analytically is called a probability 
distribution function. 

In accordance with the procedure followed in plotting the distri- 
bution curve, the area of the bar of width dx (see Fig. 11.16) equals 
the probability of the fact that the result of a measurement will be 
within the range from x to x + dx. Denoting this probability by 
dP x , we can write that 

dP x — / (x) dx (11.6) 

The subscript “x” used with dP indicates that we have in mind the 
probability for the interval whose left-hand edge is at the point 
with the coordinate x. The area confined by a distribution curve, 
like that of a histogram, equals unity. This signifies that 

♦» 

j / (x) dx = j dP x = 1 (11.7) 

Integration is performed over the entire interval of possible values 
of the quantity x . Equation (11.7) is an analogue of Eq. (11.2). 

Knowing the distribution function / ( x ), we can find the mean 
value of the results of measuring the quantity x . In dN x — N dP x 
cases, a result equal to x is obtained. The sum of such results is 
determined by the expression x dN x — xN dP x . The sum of all 

the possible results is ^ x dN x = J xN dP x . Dividing this sum by 
the number of measurements N> we get the mean value of the quan- 
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tity x : 

<x) = J x dP x (11.8) 

This equation is an analogue of Eq. (11.5). 

Using Eq. (11.6) for dP x in Eq. (11.8), we obtain 

(x)=jx/(x)dx (11.9) 

Similar reasoning shows that the mean value of a function <p (x) 
can be calculated by the equation 

(<p (x)) = j <p (x) f (x) dx (11.10) 

For example, 

(z 2 ) = j z 2 f(z)dx (11.11) 

11.2. Nature of the Thermal Motion 
of Molecules 

If a gas is in equilibrium, its molecules move absolutely without 
order, chaotically. All the directions of motion are equally probable, 
and none of them can be given preference over others. The velocities 
of the molecules may have the most diverse values. Upon each 
collision with other molecules, the magnitude of the velocity or 
speed of a given molecule should, generally speaking, change. It 
may grow or diminish with equal probability. 

The velocities of molecules change by chance upon collisions. 
A molecule in a series of consecutive collisions may receive energy 
from its collision partners, and as a result its energy will consider- 
ably exceed the mean value (e). Even if we imagine the absolutely 
fantastic case, however, in which all the molecules of a gas give up 
their energy to a single molecule and stop moving, the energy of 
this molecule, and consequently its velocity too, will still be finite. 
Thus, the velocity of molecules of a gas cannot have values beginning 
with a certain u max and ending with infinity. Taking into consider- 
ation that processes which would lead to the concentration of a 
considerable portion of the total energy of all the molecules on one 
molecule have a low probability, we can say that very high velocities 
in comparison with the mean value of the velocity can be realized 
extremely rarely. In exactly the same way, it is virtually impossible 
for the velocity of a molecule to vanish completely as a result of 
collisions. Hence, very low and very high velocities in comparison 
with the mean value have a low probability. The probability of the 
given value of v tends to zero both when v tends to zero and when 
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it tends to infinity. It thus follows that the velocities of molecules 
are mainly grouped near a certain most probable value. 

The chaotic nature of motion of molecules can be illustrated with 
the aid of the following procedure. Let us surround point O with 
a sphere of arbitrary radius r (Fig. 11.2). Any point A on this sphere 
determines the direction from O to A. Consequently, the direction 
in which the molecules of a gas move at a certain moment can be 
set by points on the sphere. The equal probability of all the direc- 
tions results in the fact that the points showing the directions of 
motion of the molecules will be distributed 
over the sphere with a constant density. 

The latter equals the number A of mole- 
cules being considered divided by the surface 
area of the sphere 4nr 2 . Collisions lead to 
changes in the directions of motion of the 
molecules. As a result, the positions of the 
N points on the sphere continuously change. 

Owing to the chaotic nature of the motion 
of the molecules, however, the density of the 
points at any spot on the sphere remains 
constant all the time. 

The number of possible directions in 
space is infinitely great. But at each mo- 
ment a finite number of directions is re- 
alized, equal to the number of molecules 
being considered. Therefore, putting the 
question of the number of molecules having a given (depicted by 
the point on the sphere) direction of motion is deprived of all meaning. 
Indeed, since the number of possible directions is infinitely great, 
whereas the number of molecules is finite, the probability of at 
least one molecule flying in a strictly definite direction equals zero. 
A question we are able to answer is what number of molecules move 
in directions close to the given one (determined by point A on the 
sphere). All the points of the surface elements AS of the sphere taken 
in the vicinity of point A (see Fig. 11.2) correspond to these direc- 
tions. Since the points depicting the directions of motion of the 
molecules are distributed uniformly over the sphere, then the number 
of points within the area AS will be 

AN a = N-£L ( 11 . 12 ) 

The subscript A indicates that we have in view the molecules whose 
directions of motion are close to that determined by point A. 

The ratio A S/r 2 is the solid angle AQ subtended by the area AS. 
Therefore, Eq. (11.12) can be written as follows: 

an a =n-£L 


Fig. It. 2 


(11.13) 
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Here AQ is the solid angle containing the directions of motion of 
the molecules being considered. We remind our reader that 4 jt 
is a complete solid angle (corresponding to the entire surface of the 
sphere). 

The direction of OA can be given with the aid of the polar angle 0 
and the azimuth <p (Fig. 11.3). Hence, the directions of motion of 
the molecules of a gas can be characterized by giving for each mole- 
cule the values of the angles 6 and q> measured from a fixed direction 




OZ (we can take the direction of a normal to the surface of the vessel 
confining a gas as such a direction) and the plane P 0 drawn through it. 

Let us surround the origin of coordinates O with a sphere of radius 
r and find the element dS of the sphere corresponding to the incre- 
ments d0 and dtp of the angles 0 and <p (Fig. 11.4). The element 
being considered is a rectangle with the sides r d0 and r sin 0 dtp. 
Thus, 

dS = r 2 sin 0 dQ dtp (11.14) 

The expression obtained gives an element of the surface r = const 
in a spherical system of coordinates. 

Dividing Eq. (11.14) by r 2 , we shall find the element of the solid 
angle corresponding to the angle intervals from 0 to 0 + d0 and 
from q> to <p + d<p: 

dQ 0> ,, = sin 0 dQ dtp (11.15) 

Two spheres of radii r and r + dr, two cones with the apex angles 
0 and 0 + d0, and two planes forming the angles <p and qp + dp 
with P 0 separate in space a rectangular parallelepiped with the sides 
r d9, r sin 0 dtp and dr (see Fig. 11,4). The volume of this parallel- 
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epiped 

dV = r 2 sin 0 dr d0 d<p (11.16) 

is an element of volume in a spherical system of coordinates (the 
volume corresponding to an increase in the coordinates r, 0, and 9 
by dr, d0, and <frp). 

Passing over from deltas to differentials in Eq. (11.13) and intro- 
ducing Eq. (11.15) for d£l, we arrive at the expression 

_ N sin Q^Bd<^ (11.17) 

The subscripts 0 and 9 of dN indicate that we have in view the 
molecules whose directions of motion correspond to the angle inter- 
vals from 0 to 0 + dQ and from 9 to 9 + ^9. 


11.3. Number of Collisions of Molecules 
with a Wall 

Let us consider a gas in equilibrium confined in a vessel. We shall 
take an element A S of the surface of the vessel and count the number 
of collisions of molecules with this element 
during the time A*. 

Let us separate from the N molecules in 
the vessel those dN v molecules whose veloc- 
ities have magnitudes ranging from u to 
v + dv . Of these molecules, the number of 
molecules whose directions of motion are con- 
fined within the solid angle di 2 equals 


dN 9 f q> — ■ dN p 


4 Jt 


(11.18) 



[see Eq. (11.17)]. Of the molecules separated 
in this way, the ones confined in an oblique 
cylinder with the base A S and the altitude 
(v cos 0) A t (Fig. 11.5) will fly during the time A t up to the area A S 
and collide with it*. The number of these molecules is 


<r is the volume of the vessel). 


AS (u cos 0) At 


(11.19) 


* The objection could seem possible that part of these molecules would col- 
lide with other molecules on their way to the wall, as a result of which they 
will chance their direction and will not reach the area AS. These collisions, how- 
ever, will not violate the chaotic nature of motion of the molecules: the tran- 
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To obtain the total number of collisions of the molecules with the 
area A S, we must summate Eq. (11.19) over the solid angle 2n 
(corresponding to changes in 0 from 0 to ji/2 and changes in <p from 0 
to 2n) and over the velocities ranging from 0 to v max , where v max 
is the maximum velocity the molecules can have in the given con- 
ditions (see the preceding section). 

We shall begin with summation over the directions. For this 
purpose, we shall write in the form sin 0 <20 d<p [see Eq. (11.15)1 
and integrate Eq. (11.19) with respect to 0 within the limits from 0 
to ji/2 and with respect to <p within the limits from 0 to 2n: 

n/ 2 2 n 

dv B = dN °l£y - j cosGsinGde j 
0 0 

Integration with respect to d<p gives 2 ji, and the integral with respect 
to dd equals 1/2. Hence, 

i dN„vASAt / a a oa\ 

<*v„ = w (11.20) 

This expression gives the number of times the molecules flying in 
the directions confined within the solid angle 2n and having veloc- 
ities from v to v + dv collide with the area AS during the time At. 

Summation over the velocities gives the total number of colli- 
sions of the molecules with the area AS during the time At: 

"msi 

= ^ j vdN 0 (11.21) 

o 

The expression 



vdN„ 


is the mean value of the speed v. Substituting the product N (v) 
for the integral in Eq. (11.21), we find that 


VAS.M^-^-NW^-LASAtnW (11.22) 


Here n = N/V is the number of molecules of a gas in unit volume* 
Finally, dividing Eq. (11.22) by AS and A we shall find the 
number of collisions of the gas molecules with a unit surface area 


sition of a certain number of molecules from the group moving toward the wall 
to groups moving in other directions is attended by the simultaneous transition 
of the number of molecules from the other groups to the one moving toward the 
wall. Consequently, in calculating the number of molecules reaching the wall, 
the collisions of the molecules with one another may be disregarded. 
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of the wall in unit time: 

v = -^n<u) 


(11.23) 


The result obtained signifies that the number of collisions is pro- 
portional to the number of molecules per unit volume (the “concen- 
tration” of the molecules) and to the mean value of the speed of the 
molecules (and not their velocity — the mean value of the velocity 
vector of the molecules for equilibrium of a gas is zero). We must 
note that the quantity v in Eq. (11.23) is 
the density of the stream of molecules strik- 
ing the wall. 

Let us consider an imaginary unit area 
in a gas. If the gas is in equilibrium, the 
same number of molecules will fly through 
this area in both directions on an average. 

The number of molecules flying in each 
direction in unit time is also determined 
by Eq. (11.23). 

Equation (11.23) can be obtained with an 
accuracy up to the numerical coefficient with 
the aid of the following simplified reason- 
ing. Let us assume that the gas molecules travel only in three 
mutually perpendicular directions. If our vessel contains N mole- 
cules, then at any moment N / 3 molecules will travel in each of these 
directions. One half of them (i.e. N/Q molecules) will travel in a 
given direction to one side, and the other half to the other side. 
Hence, 1/6 of the molecules travel in the direction we are interested 
in (for example, along a normal to the given element A S of the 
vessel’s wall). 

Let us also assume that all the molecules travel with the same 
speed equal to (v). Therefore, during the time At, the wall element 
A S will be reached by all the molecules moving toward it that are 
inside a cylinder with the base AS and the altitude (t;> At (Fig. 11.6). 
The number of these molecules is Av = (rc/6) AS (v) At. Accord- 
ingly, the number of collisions with a unit area in unit time will be 



v = n{v) 


(11.24) 


The expression obtained differs from Eq. (11.23) only in the value 
of the numerical factor (1/6 instead of 1/4). 

Retaining our assumption on the motion of the molecules in three 
mutually perpendicular directions, but negating the assumption on 
the molecules having identical speeds, we must separate from among 
the molecules in unit volume those dn v molecules whose speed ranges 
from i; to v + dv. The number of molecules having such speeds and 
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reaching the area A S during the time A t is 

d\ v = dn B ASv At (11.25) 

We get the total number of collisions by integrating Eq. (11.25) 
with respect to speeds: 

r max 

Av = | dv„ = -i- AS At T v dn B = ~ AS A tn ( v ) 

o 

Finally, dividing Av by AS and At, we get Eq. (11.24). Thus, our 
assumption that the molecules have identical speeds does not affect 
the result obtained for the number of collisions of the molecules 
with the wall. As we shall see in the following section, however, this 
assumption changes the result of pressure calculations. 


11.4. Pressure of a Gas on a Wall 


The walls of a Vessel containing a gas are continuously bombarded 
by its molecules. The result is that the wall element AS receives 
a momentum during one second that equals the force acting on this 

element. The ratio of this force to the area 
AS gives the pressure exerted by the gas 
on the walls of the vessel. The pressure of 
the gas on different portions of the vessel 
walls is the same owing to the chaotic 
nature of motion of the molecules (natu- 
rally, provided that the gas is in an equi- 
librium state). 

If we assume that the molecules rebound 
from a wall according to the law of mirror 
reflection (0 r efi=0iuc) and the magnitude of 
the velocity of a molecule does not change*, 
Fig. H.7 then the momentum imparted by a molecule 

to the wall upon colliding with it will be 
2 mv cos 0 (Fig. 11.7), where m is the mass of a molecule. This momen- 
tum is directed along a normal to the area. Each of the dv D ,e >v 
molecules (see Eq. (11.19)1 imparts a momentum of 2mv cos 0 to 
the wall, and all these molecules impart a momentum of 



dK v> e, v = 2/nv cos 0 dv Pi e . 9 = dN 0 


d£2e, , p 2mv* cos® 0 AS At 
“4n V 


* The interaction of the molecules with the walls of the vessel is actually of 
a more complicated nature (see p. 423), and our assumptions hold only on the 
average for a great number of collisions. 
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(We have used the symbol K for the momentum instead of the pre- 
viously used symbol p to avoid confusion — the latter symbol stands 
for pressure here.) 

Summation of the expression obtained by directions within the 
limits of the solid angle 2n (corresponding to changes in 0 from 0 
to n/2 and changes in <p from 0 to 2n) gives the momentum imparted 
by the molecules whose velocities have magnitudes ranging from 
v to v + dv: 

n/2 2 n 

dK v = dN v 2mV \*y — J cos 2 0 sin 0 d0 j d«p 

o o 


Jwe have introduced Eq. (11.15) for d€i]. Integration with respect 
to d<p yields 2n, and the integral with respect to dQ is 1/3. Hence, 


dK 0 = dN v 


mv 1 AS At 

w 


Integrating this expression with respect to velocities from 0 to 
i/ maX) we get the total momentum imparted to the area A S during 
the time At: 

*max 

A K= m j v*dN, (11.26) 

0 


The expression 


N 


®max 

j v*dN v 
o 


is the mean value of the Square of the velocity of the molecules. 
Substituting the product N (v 2 ) for the integral in Eq. (11.26), we 
find that 


AK = N (v 2 ) =-j- nm (v 2 ) AS A/ 


(» = N/V) is the number of molecules in unit volume). Finally, 
dividing this expression by A S and At, we obtain the pressure of a 
gas on the walls of the vessel containing it: 

p = -l-nm(v 2 ) = -j- n (11.27) 

We have assumed that all the molecules have the same mass. We 
can therefore put it inside the sign of the mean quantity. As a result, 
Eq. (11.27) acquires the form 

2 / mv * \ 2 . . 

P = -3 n \-2-/ = T n(6ir > 


(11.28) 
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where <et r > is the mean value of the kinetic energy of translation 
of the molecules. 

Let us obtain an expression for the pressure proceeding from the 
simplified notions that led us to Eq. (11.24). According to these 
notions, each molecule imparts a momentum of 2 m(v) to the wall 
it collides with. Multiplying this momentum by the number of 
collisions [see Eq. (11.24)], we get the momentum imparted to a 
unit area in unit time, i.e. the pressure. We thus obtain the equation 

p = -j-n{v)-2m(v) = -^-nm(v) 2 (If. 29) 

This equation differs from Eq. (11.27) in that it contains the square 
of the mean velocity (y> 2 instead of the mean square of the velocity 
(y 2 >. We shall see on a later page (in Sec. 11.5) that these two 
quantities differ from each other, i.e. (y 2 > (y> 2 . 

In a more accurate calculation, we must multiply the number of 
molecules determined according to Eq. (11.25) by 2 mv and then 
summate over all the v's. As a result, we get the momentum imparted 
to the area AS during the time A*: 

r max r max 

AK = j -^dn v AS At-2mv = ^- AS Atm j v z dn v = 
o o 

= y- AS At nrrt { v 2 ) 

Dividing this equation by AS and At, we get Eq. (11.27) for the 
pressure. Thus, on the basis of our simplified notion of the molecules 
travelling in three mutually perpendicular directions, we have 
obtained an exact expression for the pressure. The explanation is 
that this simplification leads on the one hand to diminishing of the 

number of collisions of the molecules with the wall n(v) instead 

of y 7i (y), see Eqs. (11.23) and (11.24)J, and on the other to 

overstating of the momentum transmitted to the wall in each col- 
lision. In our simplified derivation, we assumed that the wall receives 
a momentum of 2 mv upon each collision. Actually, however, the 
magnitude of the momentum imparted to the wall depends on the 
angle 0, and as a result the mean momentum imparted in one colli- 
sion is y mv. In the long run, both inaccuracies mutually compen- 
sate each other and, notwithstanding the simplified nature of our 
derivation, we obtain an exact expression for the pressure. 



11.5. Mean Energy of Molecules 

Let us write Eq. (11.28) for the pressure obtained in the preceding 
section and the equation of state (10.21) of an ideal gas next to each 
other: 

2 

p = -%-n(e u ); p = nkT 

A comparison of these equations shows that 

<etr> = -| -W (11.30) 

We have thus arrived at an important conclusion: the absolute tem- 
perature is a quantity proportional to the mean energy of translation 
of molecules. Only gas molecules have translation. For liquids and 
solids, the mean energy of the molecules is proportional to the abso- 
lute temperature only when the motion of the molecules can be 
treated classically. In the quantum region, the mean energy of the 
molecules stops being proportional to the absolute temperature. 

Equation (11.30) is remarkable in that the mean energy is found 
to depend only on the temperature and is independent of the mass 
of a molecule. 

Since (e tr > = (mi; 2 / 2> = (m/2) ( v 2 ) it follows from Eq. (11.30) 
that 

(y2)= ^r (ii.3i) 

Representing i? in the form of the sum of the squares of the velocity 
components, we can write: 

<y2> = (v%) + (v\) + (v\) 

Owing to all the directions of motion having equal rights, the fol- 
lowing equation is obeyed: 

{vl) = {vl) = {v\) 

Taking this into account, we find that 

(V%) = ± (11.32) 

Equation (11.30) determines the energy of only the translation of 
a molecule. In addition to translation, however, rotation of a mole- 
cule and vibrations of the atoms in the molecule are possible. Both 
these kinds of motion are associated with a certain store of energy. 
The latter can be determined by the theorem on the equal distri- 
bution of the energy by the degrees of freedom of a molecule estab- 
lished by statistical physics. 
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The number of degrees of freedom of a mechanical system is defined 
as the number of independent quantities by means of which we can set 
the position of the system . Thus, the position of a point particle in space 
is determined completely by setting the values of three of its coor- 
dinates (for example, the Cartesian coordinates x, z, or the spheri- 
cal coordinates r, 0, q>, etc.). Accordingly, a point particle has three 
degrees of freedom. 

The position of a perfectly rigid body can be determined by set- 
ting three coordinates of its centre of mass (x, y , z), the two angles 

0 and <p indicating the direction 
of an axis associated with the 
body and passing through its 
centre of mass (Fig. 11.8), and, 
finally, the angle yp determining 
the direction of a second axis 
associated with the body and 
perpendicular to the first one. 
Hence, a perfectly rigid body has 
six degrees of freedom. A change 
in the coordinates of the centre 
of mass with the angles 0, <p, and 
yp remaining constant is due to 
translation of a rigid body. 
Therefore, the relevant degrees 
of freedom are called transla- 
tional. A change in any of the angles 0, <p, yp with an unchanging posi- 
tion of the centre of mass is due to rotation of a body, and in this 
connection the corresponding degrees of freedom are called rota- 
tional. Hence, of the six degrees of freedom of a perfectly rigid body, 
three are translational and three rotational. 

A system of N point particles between which there are no rigid 
constraints has 3 N degrees of freedom (the position of each of the N 
particles must be set by three coordinates). Any rigid constraint 
establishing an unchanging mutual arrangement of two particles 
reduces the number of degrees of freedom by one. For example, if a 
system consists of two point particles with a constant distance l 
between them (Fig. 11.9), then the number of degrees of freedom of 
the system is five. Indeed, in this case, the following relation holds 
between the coordinates of the particles: 

(x 2 — x,) 2 + (y 2 — yd 2 + (* 2 — zi) 2 = ** (11.33) 

owing to which the coordinates will not be independent: it is suffi- 
cient to set any five coordinates, and the sixth one will be deter- 
mined by condition (11.33). To classify these five degrees of free- 
dom, we shall note that the position of a system formed by two 
rigidly connected point particles can be determined as follows: we 
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can set the three coordinates of the centre of mass of the system 
(Fig. 11.10) and the two angles 0 and <p that determine the direction 
in space of the axis of the system (i.e. the straight line passing 
through both points). It thus follows that three degrees of freedom 
will be translational and two rotational. The latter correspond to 



rotation about two mutually perpendicular axes O' O' and O n O m 
that are at right angles to axis 00 of the system (Fig. 11.11). Rota- 
tion about axis 00 is deprived of meaning for point particles. 

If two point particles are connected by an elastic constraint in- 
stead of a rigid one (i.e. so that any change in the equilibrium dis- 



tance r 0 between the particles results in the setting up of forces 
tending to establish the initial distance between the particles), then 
the number of degrees of freedom will be six. The position of the 
system in this case can be determined by setting the three coordi- 
nates of the centre of mass (Fig. 11.12), the two angles 0 and <p, and 
the distance r between the particles. Changes in r correspond to 
vibrations in the system, consequently this degree of freedom is 
called vibrational. Thus, the system considered has three transla- 
tional, two rotational, and one vibrational degree of freedom. 

Let us consider a system consisting of N point particles elastically 
connected to one another. Such a system has 3 N degrees of freedom. 
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The particles have an equilibrium configuration corresponding to a 
minimum potential energy of the system. The equilibrium config- 
uration is characterized by quite definite mutual distances between 
the particles. If the particles are brought out of their positions cor- 
responding to equilibrium configuration, vibrations appear in the 
system. The position of the system can be determined by setting the 
position of the equilibrium configuration and the quantities character- 
izing the displacements of the particles from their equilibrium posi- 
tions. The latter quantities correspond to the vibrational degrees of 
freedom. 

The position of equilibrium configuration, like that of a perfectly 
rigid body, is determined by six quantities which three translational 
and three rotational degrees of freedom correspond to. The number 
of vibrational degrees of freedom is thus 3 N — 6*. 

Experiments on measuring the heat capacity of gases have shown 
that atoms must be treated as point particles in determining the 
number of degrees of freedom of a molecule. Consequently, three 
translational degrees of freedom should be ascribed to a monatomic 
molecule. The degrees of freedom ascribed to a diatomic molecule 
depend on the nature of the bond between the atoms. They include 
either three translational and two rotational degrees of freedom (with 
a rigid bond), or, apart from these five, another vibrational degree 
of freedom (with an elastic bond). A triatomic molecule with rigid 
bonds has three translational and three rotational degrees of free- 
dom, etc. 

We must note that no matter how many degrees of freedom a mole- 
cule has, three of them are translational. Since none of the transla- 
tional degrees of freedom of a molecule has priority over the other 
two, an identical energy should fall to each of them on an average. 

This energy is one-third of the value given by Eq. (11.30), i.e. ^kT. 

The equipartition principle is derived in classical statistical 
physics**. It states that an identical kinetic energy equal to — kT 

resides on the average in any degree of freedom . 

According to this principle, the mean energy of one molecule 
(e> will be the greater (at the same temperature), the more complex 
is the molecule and the more degrees of freedom it has. In deter- 
mining (e), we must take into account that a vibrational degree of 
freedom must have an energy capacity that is twice the value for a 
translational or rotational one. The explanation is that translation 


• It is assumed that the equilibrium positions of the particles are not on 

one straight line. Otherwise there will be only two rotational degrees of freeddm, 
and 3 N — 5 vibrational ones. We treated this case in dealing with a system 
consisting of two particles. ^ 

* * This derivation is beyond the scope of a course in general physics. \ 
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and rotation of a molecule are associated with the presence of only 
kinetic energy, whereas vibration is associated with the presence of 
both kinetic and potential energy; for a harmonic oscillator, the 
mean value of the kinetic and the potential energy is the same. Hence, 
two halves of kT must reside in each vibrational degree of freedom — 
one in the form of kinetic energy and one in the form of potential 
energy. 

Thus, the mean energy of a molecule should be 

(s) = -LkT (11.34) 

where i is the sum of the number of translational, the number of 
rotational, and the double number of vibrational degrees of free- 
dom of a molecule: 

i = ^tr + ttrot 2ttvib (11.35) 

For molecules with a rigid bond between their atoms, i coincides 
with the number of degrees of freedom of a molecule. 

Molecules of an ideal gas do not interact with one another. We 
can therefore find the internal energy of one mole of an ideal gas by 
multiplying the Avogadro constant by the mean energy of one 
molecule: 

f/m - N a (e> = -L JV A kT = (11.36) 

A comparison of this equation with Eq. (10.28) shows that 

c v = -Lr (11.37) 

With a view to Eq. (10.33), we find that 

= (11.38) 

Hence, 

T— - ¥■ <«- 39 ) 

Thus, the quantity y is determined by the number and the nature 
of degrees of freedom of a molecule. 

Table 11.1 gives the values of C Vy C p , and y obtained for different 
species of molecules by Eqs. (11.37), (11.38), and (11.39). Table 11.2 
compares the theoretical results with experimental data. The theoret- 
ical values have been obtained (except for the case indicated in the 
footnote to the table) on the assumption that the molecules are rigid; 
the experimental values have been obtained for temperatures close 
to room temperature. 
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TabU 11.1 


Molecule 

Nature of 
interatomic 
bond 

Number of degrees 
of freedom 


r 



trans- 

nation- 

al 

rota- 

tional 

| ! 

vibra- 

tional 

i 


C P 

Y 

Monatomic 

— 

3 

— 

— 

3 

t r 

4* 

1.67 

Diatomic 

Rigid 

3 

2 

— 

5 

y r 

4« 

1.40 

Ditto 

Elastic 

3 

2 

1 

7 

i R 

4 s 

1.29 

With three or 
more atoms 

Rigid 

3 

3 

— 

6 

t r 

4* 

1.33 


It should seem to follow from Table 11.2 that agreement between 
theory and experiments is quite satisfactory, at any rate for mon- and 
diatomic molecules. Actually, however, matters are different. Accord- 
ing to the theory we have considered, the heat capacities of gases 

Table 11.2 


Gas 

Number 
of atoms 
in mole- 

CyXlO -3 

c p x 10-3 


Y 


cule 

theor. 

exper. 

theor. 

exper. 

theor. 

exper. 

Helium (He) 

1 

12.5 

12.5 

20.8 

20.9 

1.67 

1.67 

Oxygen (0,) 
Carbon monoxide 

2 

20.8 

20.9 

29.1 

28.9 

1.40 

1.40 

(CO) 

Water vapour 

2 

20.8 

21.0 

29.1 

29.3 

1.40 

1.40 

<H 2 0) 

3 

i 

25.0 

33.2* 

27.8 

33.2 

41.5* 

36.2 

1.33 

1.25* 

1.31 


* For i -• 8, i. e. assuming that there is additionally one vibrational degree of free- 
dom. 


ought to be integral multiples of R ! 2 because the number of degrees 
of freedom can only be integral. Therefore, even small deviations of 
C v and C p from values that are multiples of R/2 have fundamental 
significance. Examination of the table shows that such deviations, 
exceeding the possible errors of measurements, are encountered. v 
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The discrepancies between theory and experiments become espe- 
cially striking if we turn to the temperature dependence of the heat 
capacity. Figure 11.13 shows a curve of the temperature dependence 
of the molar heat capacity C v obtained experimentally for hydrogen. 
The heat capacity should be independent of the temperature accord- 
ing to theory. A glance at the figure shows that this holds only 
within the limits of separate temperature intervals, and that within 
different intervals the heat capacity has values corresponding to 


Cv 



different numbers of degrees of freedom of a molecule. Thus, on por- 
tion 1-1 ' , we have C v = y /?. This signifies that a molecule behaves 
like a system having only translational degrees of freedom. On 
portion 2-2', we have C v = yi?. Hence, at temperatures corres- 
ponding to this portion of the curve, in addition to the three transla- 
tional degrees of freedom manifesting themselves at lower tempera- 
tures, two rotational ones appear in a molecule. Finally, at suffi- 

7 

ciently high temperatures, C v becomes equal to y which points 

to the presence of vibrations of a molecule at these temperatures. 
Between these intervals, the heat capacity monotonously grows 
with increasing temperature, i.e. corresponds, as it were, to a frac- 
tional varying number of degrees of freedom. 

Thus, the number of degrees of freedom of a molecule manifesting 
itself in the heat capacity depends on the temperature. At low tem- 
peratures, only translation of the molecules is observed. At higher 
temperatures, rotation of the molecules is observed in addition to 
translation. And, finally, at still higher temperatures, vibrations of 
the molecules are also added to the first two kinds of motion. An 
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indicated by the monotonous nature of the heat capacity curve, here 
not all the molecules at a time are involved in rotation, and then in 
vibration. First rotation, for example, begins to be observed only in 
a small fraction of the molecules. This fraction grows with elevation 
of the temperature, and in the long run when a definite temperature 
is reached, virtually all the molecules will be involved in rotation. 

Matters are similar for vibration of the 


£ . molecules. 

nf vb This behaviour of the heat capacity is 

explained by quantum mechanics. The quan- 

:: <£> turn theory has established that the energy 

__ ( high temp ) of rotation and vibration of molecules is 

quantized. This signifies that the energy 
of rotation and that of vibration of a mole- 
cule cannot have any values, but only dis - 
l crete ones (i.e. values differing from one 

another by a finite amount). Consequently, 
the energy associated with these kinds of 
motion can change only in jumps. Such 
restrictions do not exist for the energy of 
~~ translation. 

’ The intervals between separate al- 

lowed values of the energy (or, in accordance 
with the adopted terminology, between 
energy levels) are about an order greater 

— — <£> . for vibration than for rotation. A simplified 

(me turn p.) di agram of the rotational and vibrational 

[-— <£> . v levels of a diatomic molecule is given in 

0 (ow empj p.g 11 .14. (The distances between the rota- 

tional levels are actually not the same, 
Fig. 11.14 but this is of no significance for the ques- 

tion being considered.) 

We noted in Sec. 11.2 that the velocities of molecules are mainly 
grouped near a most probable value. Accordingly, the predominating 
part of molecules have energies close to the mean value (e), and 
only a small part of the molecules have energies considerably exceed- 
ing (e>. Hence, for an appreciable part of the molecules to be in- 
volved in rotation or vibration, their mean energy must be suffi- 
ciently high in comparison with the distance between the allowed 
levels of the relevant energy. 

Let us take such a low temperature that the mean energy of a 
molecule (e) is considerably lower than the first allowed value of 
the rotational energy (see the bottom dash line in Fig. 11.14). Now 
only an insignificant part of all the molecules will be involved in 
rotation, so that the molecules of the gas will virtually have dply 
translation. Small changes in the temperature will result in changes^ 


<£> 

(medium temp.) 

<£> 

( low temp) 


Fig. 11.14 
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only in the energy of translation, and the heat capacity of the gas 
will accordingly be y R (see 1-V on the curve depicted in Fig. 11.13). 

Elevation of the temperature is attended by an increase in (e> 
so that a constantly growing part of the molecules will be involved 
in rotation. Portion l'-2 of the curve in Fig. 11.13 corresponds to 
this process. 

After all the molecules begin to participate in rotation, the hori- 
zontal portion 2-2' commences. At temperatures corresponding to 
it, the value of (e) is still considerably lower than the distance be- 
tween the allowed levels of vibrational energy. As a result, vibration 
of the molecules will virtually be absent. With a further elevation 
of the temperature, the molecules will begin to vibrate in greater 
and greater numbers, which transition portion 2'-3 on the heat 
capacity curve corresponds to. Finally, at a sufficiently high tem- 
perature, all the molecules will be involved in vibration, and the 

y 

heat capacity will become equal to y R . 

The classical theory of heat capacity is thus approximately cor- 
rect only for separate temperature intervals. A different number of 
degrees of freedom of a molecule corresponds to each interval. 


11.6. The Maxwell Distribution 

We shall use the following procedure to find a way of quantitatively 
describing the distribution of molecules by velocity magnitudes. 
Let us take Cartesian coordinate axes in an imaginary space which 
we shall call u - space (velocity space). We shall lay off the values of 
v Xf v y , v z of individual molecules along these axes (what we have 
in view are the velocity Components along the axes x, y, z taken 
in conventional space). Hence, a point in this i;-space will corres- 
pond to the velocity of each molecule. Owing to collisions, the 
positions of the points will continuously change, but their density 
at each place will remain unchanged (we remind our reader that 
we are dealing with an equilibrium state of a gas). 

Owing to all the directions of motion having equal rights, the 
arrangement of the points relative to the origin of coordinates will 
be spherically symmetrical. Hence, the density of the points in 
our i;-space can depend only on the magnitude of the velocity v 
(or on v 2 ). Let us denote this density by Nf (u) (here N is the total 
number of molecules in the given mass of gas). Hence, the number of 
molecules whose velocity components are within the limits from 
v x to v x + dv XJ from v y to u y + du y , and from v z to v z + dv z can 
be written in the form 

dN v% . „ z = Nf (i v ) dv x dv y do % 


(11.40) 
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(the product dv x dv y dv x gives an element of volume in p-space). 

Points depicting velocities whose magnitude is confined within 
the limits from v to v + dv are in the region between spheres of 
radii v and v + dv (Fig. 11.15). The volume of this region is dv. 
Hence, the number of points in this region is determined by the 
expression 

dN v = Nf (v) 4nv* dv (1 1.41) 

This equation gives the number of molecules with velocity magni- 
tudes ranging from v to v+dv. Dividing it by JV, we get the probabili- 
ty dP v of the velocity of a mole- 
cule being within the limits from v 
to v -f - dv: 

dP v = / (v) 4ju^ dv (11.42) 

By comparing this expression with 
Eq. (11.6), we conclude that 

F (v) = /( v) 4nv z (11 .43) 

plays the part of a distribution or 
partition function of the molecules 
of a gas by velocities. 

The form of function (11.43) 
was established theoretically in 
1860 by James Maxwell (1831-1879). 
We approximately follow his reasoning in the derivation of the law 
of distribution of gas molecules by velocities set out below. 

According to Eq. (11.6), the probability of the velocity compo- 
nent v x of a molecule having a value within the limits from v x to 
v x + dv x can be written in the form 

dP Vx = <p (v x ) dv x ' ^ (11.44) 

where qp (v x ) is a distribution function. The similar probabilities for 
the other two components are determined by the equations 

dP % = K) dv v (11.45) 

dP Vz = <p(v z )dv z (11.46) 

Owing to all directions of motion having equal rights, the analytical 
form of the functions qp (v x )> qp (i; y ), and qp (v z ) must be identical. 
These functions differ only in the designation of the argument. 

Maxwell assumed that the probability of one of the components, 
for instance v Xf having different values does not depend on the 
values of the other two components (in our case v v and v z )*. This 

* This assumption can be proved very strictly, but the proof is beyond the 
cope of our course. 
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signifies that the events consisting in that v x of a molecule is within 
the limits from v x to v x + dv x , v y of the same molecule is within 
the limits from v y to v v + dv y , and, finally, v z of the same molecule 
is within the limits from v z to v z + dv z , are statistically indepen- 
dent. Therefore the probability that the velocity components of a 
molecule have values within the limits from v xt v y , v z to v x + dv„ 
v y + dv y , v z + dv z equals the product of the probabilities given 
by Eqs. (11.44)-(11.46): 

d p v x , v » z = <P (»*) <P ( v y ) cp ( v z ) dv x dv y dv z (i i . 47 ) 

[see Eq. (11.4)1. At the same time, according to Eq. (11.40), this 
probability can be written in the form 

d p v x , v v z = f (v) dv x dv y dv z (11 .48) 

A comparison of Eqs. (11.47) and (11.48) shows that 

f (P) = <p (v x ) q> (v y ) <p (v z ) (11.49) 

Taking logarithms of both sides of this equation, we get 
In / (v) = In <p ( v x ) + In <p (y„) + In <p ( v z ) 
Differentiation of this expression with respect to v x yields 

fjp) dv _ <p' (a x ) 

f(v) dv x - <p( Vx ) (ll.OU) 


(11.49) 


(11.50) 


Since v — Y~vi + v y + vj, the partial derivative of v with respect 


to v x is 


dv * V v %+ v l + u t v 

Introducing this value of the derivative into Eq. (11.50) and then 
transferring v x from the numerator of the left-hand side to the deno- 
minator of the right-hand one, we get the equation 

n^± = ^_ = j_ (li5i) 

f(u)v <p (y*) v x v ' 

The right-hand side of this equation, and therefore its left-hand 
side, is independent of the variables v y and v z . Consequently, it 
also cannot depend on v x [the quantities v xy v yj and u z in / (v) are 
symmetrical, see Eq. (11.49)1. Thus, each of the expressions in the 
left-hand and right-hand sides of Eq. (11.51) equals a certain con- 
stant which we shall denote by — a (we shall see later that this 
constant is less than zero, i.e. a > 0). 

Hence, 

cp' (y x ) 1 cp' (i>x) 

^ = — a, or ■ = — cu; x 

9 (v x ) v x cp (y x ) 


<P f (^x) 1 = _ 

9 (v x ) v x 9 


== — av~ 
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Integration yields 

at;* 

In <p (v x ) *= JL-f in.4 

where A is a constant. Thus, 

2 ~ ) (11.52) 

Similarly, 

<p(Vj,) = i4exp( ^), q>(v I ) = ^4exp( 

Multiplication of these functions yields 

m-#», [- “ w y ;) (-2f) (11.58) 

It can be seen from the form of functions (11.52) and (11.53) that 
the constant a must be greater than zero. If it were negative, these 
functions would grow without restriction with increasing v. 

The constant A is found from the normalization condition (11.7). 
According to this condition, 

+°° . j 

i t CLV" \ / 

exp^ j dv x =s 1 (11.54) 

— oo 

We pointed out in Sec. 11.2 that the values of v (and, consequently, 
v x too) cannot exceed a certain very great, but finite value i; max . 
At the same time, we have taken — oo and +oo as the integration 
limits. This extension of the integration limits does not introduce 
an appreciable error. The integrand diminishes with increasing v x 
so rapidly that at sufficiently great values of v x it does not virtually 
differ from zero. Therefore, the contribution of the integration paths 
from i7 max to oo and from — u max to — oo is negligibly small. 

The integral in Eq. (11.54) is a Poisson integral with p = a/2 [see 
Appendix 2, Eq. (A.l)]. According to Eq. (A.3), we have 

jexp(-^K =l /-^ = v^r < 1155 > 

— oo 

Introducing this value into Eq. (11.54), we find that A 2nl a = 1. 

Hence, 


(11.56) 
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Using the found value of A in Eqs. (11.52) and (11.53), we arrive 
at the equations 

<p(”*) = j/ -£-exp ( — (11.57) 

1 (v)= (^r) 3/2 ® x P ( — T~) < 11 - 58 ) 

To find the constant a, we shall use the function (11.57) to cal- 
culate the value of <t;|> and equate the expression obtained to the 
value of kT/m found by calculating the pressure (see Eq. (11.32)1. 
In accordance with Eq. (11.11) 

<yj>= j vl<f> (v x ) dv x = y ~ j exp ( — v%dv x (11.59) 

— oo —o o 

According to Eq. (A. 4), we have 

+ I “p(-t <“.« o) 


Substituting for the integral in Eq. (11.59) its value from Eq. 
(11.60), we find that 

Comparison with Eq. (11.32) yields 

o. = ~ (11.61) 


The use of this value in Eqs. (11.57) and (11.58) leads to the final 
expressions for the distribution functions: 

?( y *) = (w) 1/2ex p(-w) < 11 - 62 ) 

w) 3/2ex P (-w) ( H - 63 > 

It must be remembered that function (11.63) when multiplied by 
N determines the density of the points depicting the velocities of 
the molecules in i;-space. Multiplication of this function by dv x dv v dv z 
gives the probability dP v ^ Vyt Vz of the velocity components being 
within the limits from v x , v y , v 2 to v x + dv x , v v + dv y , v 2 + dv z . 
Not only the magnitude of the velocity, but also its direction vary 
only within small limits determined by dv x , dv y , and dv z . If we 
are interested only in the probability of the magnitude of the velocity 
regardless of the direction of motion of a molecule, i.e. dP v , then 
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we must take the distribution function in the form of Eq. (11.43). 
Multiplication of this function by dv gives the probability of the 
velocity of a molecule having the magnitude (with an arbitrary 
direction of motion) within the limits from v to v + dv . 

According to Eqs. (11.43) and (11.63), we have 

F M-(-mrY' 2 °*r (-w) 4 - 1 ’* (**•«) 

This function is characterized by the exponent containing the nega- 
tive ratio between the kinetic energy of a molecule corresponding 



Fig. 11.16 Fig. 11.17 


to the velocity v being considered and kT , i.e. a quantity charac- 
terizing the mean energy of the molecules of a gas. 

A graph of function (11.62) is shown in Fig. 11.16. It coincides 
with the Gaussian distribution of a random quantity. 

A graph of function (11.64) is shown in Fig. 11.17. Since when v 
increases, a factor of the kind exp ( — av 2 ) diminishes more rapidly 
than the factor v 2 grows, the function, which begins at zero (owing 
to v 2 ), reaches a peak and then asymptotically tends to zero. The 
area enveloped by the curve equals unity [compare with Eq. (11.7)1. 

Let us find the mean velocity of the molecules (v) (we have in 
mind the arithmetical mean velocity). By analogy with Eq. (11.9), 
we have 

<»> =fvF (v) dv - (- 2 ^F ) 3/2 4jx j exp ( - -g£-) ifidv 
0 0 

By passing over to the variable £ = v 2 and integrating by parts, we 
arrive at the following result: 


(11.65) 
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According to Eq. (11.11) 

< y2 > = ji;2F( l ;)rfy=( 1 £ r ) 3/2 4n J exp ( - -gj-) dv (11.66) 
o o 

From Eq. (A.6), we get 

f l mv* \ J 3 , / n 3 / 2nkT \5/2 

J ep ( 2kT ) V V 8 V (m/2kT) i ~ 8n* ( m ) 

0 

Introducing this value of the integral into Eq. (11.66), we get for 
(v 2 ) the value 3 kTIm that we already know [see Eq. (11.31)1. This 
is not surprising because when finding the value of a in Eq. (11.57) 
we proceeded from Eq. (11.32), i.e. in essence from Eq. (11.31). 
The square root of (u z ) is called the mean square velocity: 

The velocity corresponding to the maximum of F(v) will be the 
most probable velocity- Taking the derivative of Eq. (11.64) with 
respect to i;, omitting the constant factors and equating the expres- 
sion obtained to zero, we arrive at the equation 

/ mv 2 \ r 0 mv 2 “I ~ 

ex p(— 2 £r)L 2 — vr\ v=0 

The values v = 0 and v = oo satisfying this equation correspond 
to minima of F (*;). The value of v making the expression in brackets 
vanish is the required most probable velocity i; prob : 

Vprob = |/ (11.68) 

By comparing Eqs. (11.68), (11.65), and (11.67), we find that 

yprob • <*>) : sq = = 1 2 1.13 : 1.22 

Figure 11.18 illustrates this proportion. 

Using Eq. (11.68) in (11.64), we shall find the maximum value of 
the function F (u): 

It can be seen from Eqs. (11.68) and (11.69) that when the tempera- 
ture grows (or the mass of a molecule diminishes), the peak of the 
curve moves to the right and becomes lower. The area confined by 
the curve, as we know, remains unchanged. Figure 11.19 compares 
two distribution curves that can be interpreted either as relating to 
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different temperatures T x and T 2 (with identical m ), or as relating 
to different masses of the molecules m 1 and m 2 (with identical 7). 

The relative number of molecules whose velocity exceeds a certain 
value v 0 is determined by the expression 


j F(v) dv 

*9 

In Fig. 11.19, the part of the area confined by the curve that is to 
the right of v 0 corresponds to this integral. It can be seen from t,he 




figure that the relative number of molecules having velocities exceed- 
ing v o greatly increases with elevation of the temperature. 

Table 11.3 gives the relative number of molecules A N/N for 
different velocity intervals calculated with the aid of function (11.64). 
Inspection of the table shows that the velocity of 70 per cent of all 

Table 11.3 


^prob 

tiN/N, % 

^prob 

A N/N, % 

0-0.5 

8.1 

2-3 

4.6 

0.5-1. 5 

70.7 

>3 

0.04 

1.5-2 

16.6 

>5 

8x10“* 


the molecules differs from the most probable value by not over 
50 per cent. Only 0.04 per cent of the molecules have a velocity 
exceeding i; prob more than three times. And only one of 12 000 mil- 
lion molecules, on the average, has a velocity exceeding 5i; pro b- 
Let us assess the mean velocity of oxygen molecules. It is conven- 
ient to perform the calculations by replacing k/m in Eq. (11.65) 
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with the ratio RIM equal to it. The expression for the mean velocity 
thus becomes 

m-VW («™) 

The molecular mass of oxygen is 32. Consequently, the mass of a 
mole M = 32 X 10 -s kg/mol. Room temperature is about 300 K. 
Introducing numerical values into Eq. (11.70), we get 


<»> = ]/ 


8 X 8.31 X 300 
3.14 X 32 X 10~ s 


500 m/s 


Thus, each molecule of oxygen travels a path, on the average, of 
0.5 km in one second. Since a molecule collides very frequently with 
other molecules, this path consists of a great number of short straight 
lengths forming a broken line. 

Hydrogen molecules have a mass that is 1/16 that of an oxygen 
molecule. As a result, their velocity at the same temperature will be 
four times greater and will be almost 2 km/s at room temperature. 

If we have a mixture of gases in equilibrium, then the distribution 
(11.64) occurs within the limits of the molecules of each species with 
its own value of m. The heavier molecules will travel on the average 
with a lower velocity than the lighter ones. 

On the basis of the distribution of molecules by velocities 

^ =Ar (w) 3/2ex P (-w) 4ny2dy < 11 - 71 ) 

we can find the distribution of the molecules by their values of the 
kinetic energy of translation (we shall denote it by the symbol e). 
For this purpose, we must pass over from a variable v to a variable e 
equal to mu 2 12. Substituting v — (2 e/m) 1 / 2 and dv = (2me) _1/2 de 
in Eq. (11.71), we obtain 

dN z = N y=- (fcT)- 3/2 exp ( - ^-) /i de (11.72) 

where dN e stands for the number of molecules whose kinetic energy 
of translation has values ranging from e to e + de. 

The distribution of the molecules by values of e is thus character- 
ized by the function 

/(e)=^exp (— Ye (11.73) 

where A is a normalization factor equal to (2 /|^ jx) ( kT )~ 3/ *. 
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11.7. Experimental Verification 

of the Maxwell Distribution Law 

The first experimental determination of the velocities of mole- 
cules was conducted by 0. Stem in 1920. The apparatus he used for 
this purpose consisted of two coaxial cylinders (Fig. 11.20). A silver- 
coated platinum wire was made taut along the axis of the apparatus. 
When the wire was heated by passing an electric current through it, 

silver atoms evaporated from its surface. The 
velocities of the evaporated atoms correspond- 
ed to the temperature of the wire. The atoms 
travelled in radial directions after escaping 
from the wire. The inner cylinder had a nar- 
row longitudinal slot through which a narrow 
beam of atoms (a molecular beam) passed 
outward. The entire apparatus was evacuated 
to prevent deviations of the silver atoms due 
to collisions with air molecules. After reaching 
the surface of the outer cylinder, the silver 
atoms settled on it and formed a layer having 
the shape of a narrow vertical stripe. 

If the entire apparatus is brought into rota- 
tion, the trace left by the molecular beam will 
be displaced along the surface of the outer 
cylinder by the amount As (see Fig. 11.20). 
This will occur because the apparatus man- 
ages to turn through the angle Aqp while the 
silver atoms are flying through the space be- 
tween the cylinders. As a result, a different 
part of the outer cylinder will be opposite the 
beam and it will be displaced relative to the 
initial trace s 0 by the amount As equal to 
R A<p ( R is the radius of the outer cylinder). 
Considering the motion of the silver atoms in a rotating reference 
frame associated with the cylinders, the displacement of the trace 
can be explained by the action on the atoms of a Coriolis force equal 
to 2m [vg>]. 

The distance As between the original and the displaced stripes 
of silver can be related to the angular velocity of the cylinders co, 
the geometry of the apparatus, and the velocity of the atoms i;. 
Denoting the flying time by A £, we can write that 

As = coi? At (11.74) 

Since the radius of the inner cylinder is small in comparison with 
that of the outer cylinder i?, the flying time can be assumed to 
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equal 



Using this expression in Eq. (11.74) and solving the resulting equa- 
tion with respect to v, we get 


v 


ioR l 
As 



The velocity of the atoms can be determined by measuring the 
displacement of the trace As and the angular velocity of the apparatus. 
Complications are introduced, however, by the fact that owing to 
velocity distribution the atoms 
have different velocities. As a 
result, the displaced stripe will 
be blurred 511 . By studying the 
profile of the trace (see Fig, 11.20), 

Stern found it possible to form 
an approximate notion of how 
the silver atoms are distributed 
by velocities. 

The results of Stern’s experi- 
ment confirmed the correctness of estimating the mean velocity of 
atoms that follows from the Maxwell distribution. This experiment, 
however, could give only very approximate information on the 
nature of the distribution itself. 

The distribution law was verified more accurately in the experi- 
ment conducted by J. Lammert (1929). He passed a molecular beam 
through two rotating disks With radial slots displaced relative to 
each other through an angle <p (Fig. 11,21). Only those of the molecules 
which pass through the slot in the first disk will fly through the 
second disk that arrive at it and encounter the slot in it. The faster 
molecules will reach the second disk too early, and the slower ones 
too late to pass through the slot. Thus, this apparatus makes it 
possible to separate molecules having a definite velocity from a beam 
(owing to the finite width of the slots, the apparatus separates mole- 
cules whose velocities are within a certain interval At;). The mean 
velocity of the molecules separated by the apparatus can be found 
from the condition that the time t x during which the molecules 
cover the distance l between the disks (t x = l/v) must coincide with 
the time t 2 during which the disks rotate through the angle q> (i.e., 
*a — <p/©). Equating these two times, we get 

€ 0 / 


v = • 


9 


* The width of the stripe obtained with a stationary apparatus is deter- 
mined only by the geometry of the apparatus, in particular by the 
width of the slot through which the molecular beam emerges. 



326 


Molecular Physics and Thermodynamics 


By changing the angular velocity co of the apparatus (or the angle <p 
between the disks), we can separate molecules having different 
magnitudes of their velocity from the beam. By trapping these 
molecules during a definite time, we can find their relative number 
in the beam. 

The results of Lammert’s experiment and of other experiments 
undertaken for the same purpose completely agree with the distribu- 
tion law established theoretically by Maxwell. 

It must be noted that the distribution of molecules by velocities 
in a beam flying out through a hole in a vessel differs somewhat from 
the distribution in a closed vessel. Since the faster molecules will 
pass through the hole in a relatively greater number than the slower 
ones, the beam will be rich in the faster molecules. Because the 
number of molecules flying through the hole in unit time is pro- 
portional to Vj the distribution in the beam will be characterized 
not by the function (11.64), but by the function 

F 1 (v)=A 1 exp(—j£r)v 3 

where A x is a normalization factor. The most probable velocity in 
this case is fprob = V 3kT/m, and the mean velocity is ( i/> = 
= V 9nkT/8m. 


11.8. The Boltzmann Distribution 

The barometric formula (10.71) obtained in Sec. 10.14, i.e. 

P=Po<* P(--^F-) 

gives the dependence of the pressure on the altitude for an imaginary 
isothermal atmosphere. Let us replace MIR in the exponent with 
the ratio mlk equal to it (m is the mass of a molecule, and k is the 
Boltzmann constant). In addition, let us substitute nkT for p and 
n 0 kT for p 0 according to Eq. (10.21). After cancelling kT in both 
sides of the equation, we arrive at the formula 

n = n 0 exp [ — ^f£") (11.75) 

Here rt is the density of the molecules (i.e. their number in a unit 
volume) at the altitude h y and is the density of the molecules at 
the altitude h Q = 0. 

It can be seen from Eq. (11.75) that with lowering of the temper- 
ature, the number of particles at altitudes other than zero dimin- 
ishes and vanishes at T = 0 (Fig. 11.22). At absolute zero, all the 
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molecules would be at the Earth’s surface. At high temperatures, on 
the contrary, n only slightly diminishes with increasing altitude so 
that the molecules are distributed by altitude almost uniformly. 

This fact has a simple physical explanation. Each concrete dis- 
tribution of the molecules by altitude sets in as a result of the action 
of two trends: (1) the attraction of 
the molecules to the Earth (char- n 
acterized by the force mg) tends 
to arrange them on the Earth’s > 

surface, and (2) thermal motion \ 

(characterized by the quantity kT) \ j 
tends to scatter the molecules uni- X ; 
formly over all the altitudes. The v \ 

first trend prevails to a greater N> S>N^ 

extent, the greater is m and the £ 
smaller is T , and the molecules 

crowd together at the Earth’s sur- _ ■. h 

face. In the limit at T = 0, the 

thermal motion stops completely, Fig. 11.22 

and under the influence of the 

Earth’s attraction the molecules occupy its surface. At high tem- 
peratures, thermal motion prevails, and the density of the mole- 
cules slowly diminishes with the altitude. 

At different altitudes, a molecule has different stores of potential 
energy: 

e p == mgh (11.76) 

Consequently, the distribution of the molecules by altitude is also 
their distribution by the values of the potential energy. In view of 
Eq. (11.76), we can write Eq. (11.75) as follows: 

ra = n 0 exp ( — (11.77) 

where n is the density of the molecules at the spot in space where 
a molecule has the potential energy e p and n 0 is the density of the 
molecules where the potential energy of a molecule vanishes. 

Examination of Eq. (11.77) shows that the density of molecules 
per unit volume is greater where their potential energy is lower, and, 
conversely, their density is lower where their potential energy is 
greater. 

According to Eq. (11.77), the ratio of to n 2 at points where the 
potential energy of a molecule has the values e p% x and e Pt 2 is 

t=” P (— ) ( “- 78 > 

L. Boltzmann proved that distribution (11.77) holds not only 
lor the potential field of the Earth’s gravitation, but also for any 
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potential field of forces containing an assembly of any identical 
particles in a state of chaotic thermal motion. Accordingly, distri- 
bution (11.77) is called the Boltzmann distribution. 

Whereas Maxwell’s law gives the distribution of particles by 
values of their kinetic energy, Boltzmann’s law gives their distri- 
bution by values of their potential energy. Both distributions are 
characterized by the presence of an exponential factor whose expo- 
nent is the ratio of the kinetic or, correspondingly, the potential 
energy of one molecule to the quantity determining the mean energy 
of thermal motion of a molecule. 

According to Eq. (11.77), the number of molecules contained in 
the volume dV = dx dy dz at a point with the coordinates x, y, z is 

dN XtV% z = n 0 exp £ — dxdydz (11.79) 

We have obtained another expression of the Boltzmann distribution 
law. 

The Maxwell and Boltzmann distributions can be combined into 
a single Maxwell-Boltzmann law according to which the number 
of molecules whose velocity components are within the limits from 
v X y v y , v z to v x + dv XJ v y + duy , v 2 + dv z and whose coordinates 
are within the limits from x, y, z to x + dx , y + dy, z + dz is 

( Ep -|- mv 2 /2 \ 

J dv x dv y dv z dxdydz (11 .80) 

(see Eqs. (11.40), (11.63), and (11.79)]. Here A is a normalization 
factor equal to n 0 (m/2nkT) 3/2 . We remind our reader that e p = 
= e p ( x , y, z) and v* = v% + v\ + vj. 

The potential energy e p and the kinetic energy m\?l < 2, and there- 
fore the total energy E, can take on a continuous series of values in 
distribution (11.80). If the total energy of a particle can take on only 
a discrete series of values E t , E 2 , . . ., as is the case, for example, for 
the internal energy of an atom, then the Boltzmann distribution has 
the form 

Nt = A exp ( — (11.81) 

where Nt = number of particles in a state with the energy E t 

A = constant of proportionality that must comply with 

the condition 2 = A 2 exp ( — EJkT) = N (here 

N is the total number of particles in the system being 
considered). 

Introducing the value of A found from the above condition into 
Eq. (11.81), we get the final expression of the Boltzmann distribution 
for the case of discrete values of the energy: 

N _ A exp( — Ej/kT) 

1 'gexH-Et/kT) 


( 11 . 82 ) 
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11.9. Determination of the Avogadro Constant 
by Perrin 

J. Perrin used distribution (11.75) as the basis of experiments 
(1909) for determining the Avogadro constant. Very minute solid 
particles suspended in a liquid are in a state of continuous disordered 
motion called Brownian motion (see Sec. 10.1). Its cause is that with 
sufficiently small particles, the momenta imparted to a particle by 
the molecules colliding with it at different sides 
are not balanced. If a particle has appreciable di- 
mensions, a great number of molecules collide with 
it simultaneously, so that the resultant momentum 
produced by all these collisions is nil. When a par- 
ticle is small, the deviations of the velocities of 
separate molecules and of the number of colliding [ 
molecules from the mean values begin to tell. If 
the velocity or the number of molecules colliding Fig. 11.23 
with a particle on one side is different than of those 
colliding with it on the other side, then the resultant momentum 
imparted to the particle will differ from zero, and the particle will 
begin to travel in the relevant direction. At the next instant, the 
resultant momentum has a different direction. Consequently, the 
particle will move chaotically all the time. 

Brownian motion points to the fact that sufficiently small par- 
ticles are involved in the thermal motion performed by molecules. 
Since they take part in thermal motion, such particles should behave 
like giant molecules, and they should obey the laws of the kinetic 
theory, in particular the Boltzmann distribution [see Eq. (11.75)1. 

The main difficulty in Perrin’s experiment was the preparation 
of identical particles and determination of their mass. Using multi- 
ple centrifuging, Perrin succeeded in preparing a very homogeneous 
emulsion of virtually identical globules of gamboge* with radii of 
the order of several tenths of a micrometre. The emulsion was placed 
in a flat glass tray 0.1 mm deep and was observed with the aid of 
a microscope (Fig. 11.23). The microscope had such a small depth 
of field that only particles in a horizontal layer about one microme- 
tre thick were visible in it. By moving the microscope vertically, it 
was possible to study the distribution of the Brownian particles in 
height (depth). 

Let h stand for the height of the layer visible in the microscope 
above the bottom of the tray. The number of particles getting into the 
field of vision of the microscope is determined by the formula 

AN = n(h) SAh 

* Gamboge (cambogia) is a thick gum resin obtained from notches in the 
bark of some species of trees growing in Indochina and Shri Lanka. 
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where n (h) is the number of Brownian particles in a unit volume at 
the height h y S is the area and A h the depth of field of the microscope. 
Applying Eq. (11.75) to the Brownian particles, we can write 

n(h) = n 0 ex p ( 

where rt 0 is the number of particles in a unit volume at h = 0, and p ' 
is the weight of a Brownian particle in the emulsion, i.e. the weight 
taken with account of the correction for Archimedes’ principle. 

Expressing the number of particles AN for two different heights 
h x and h 2y we get 


A iV] = n 0 exp ^ ■ 

P'hx ’ 

) S Ah 

kT , 

AAj = n 0 exp ( - 

p'h t 

Ah 

kT , 


Finally, taking logarithms of the ratio AN X /AN 2 * 
following expression: 

AN t Ax) 

U A/V a ~~ kT 


we arrive at the 
(11.83) 


After measuring p\ T , ( h 2 — h x ) y AN lt and AA r 2 , Eq. (11.83) can 
be used to find the Boltzmann constant k. Next, by dividing the 
molar gas constant R by fc, the Avogadro constant N A can be found. 

The value of N A obtained by Perrin using different emulsions 
ranged from 6.5 X 10 23 to 7.2 X 10 23 mol -1 . Its value determined 
by other more accurate methods is 6.02 X 10 23 mol -1 . Thus the value 
obtained by Perrin agrees quite well with values obtained by other 
methods. This proves the possibility of applying the Boltzmann dis- 
tribution to Brownian particles. 


11.10. Macro- and Microstates. 

Statistical Weight 

The state of a macroscopic body (i.e. a body formed by an enor- 
mous number of molecules) can be set with the aid of the volume, 
pressure, temperature, internal energy, and other macroscopic 
(i.e. characterizing the body as a whole) quantities. A state character- 
ized in this way is defined as a macrostate. 

A state of a macroscopic body which is characterized in such 
detail that the states of all the molecules forming the body are set is 
defined as a microstate. 

A macrostate can be achieved in various ways, and a certain mi- 
crostate of the body corresponds to each way. The number of various 
microstates corresponding to a given macrostate is called the statistical 
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weight or thermodynamic probability of the macrostate. Thus, the 
statistical weight is the number of microscopic ways in which we can 
achieve a given macrostate. 

To explain the concept of statistical weight, let us consider the 
ways in which the molecules of a gas can be distributed between the 
two halves of the vessel containing the gas. Let the total number of 
molecules be N . We shall characterize the state of the gas by the 
number of molecules n in the left half of the 
vessel (the number of molecules in the right 
half will accordingly be N — n). We shall 
characterize the state of an individual mole- 
cule by indicating which half of the vessel it 
is in. Such a description of the state of a gas 
and the states of its individual molecules is 
naturally far from complete. But it is suffi- 
cient to explain the features of the sta- 
tistical behaviour of any macrosystems 
using this example. 

Let us begin with the total number of the molecules equal to four 
(Fig. 11.24). Each molecule can be either in the left or the right half 
of the vessel with an equal probability. Therefore, the probability 
of, say, molecule 1 being in the left half of the vessel is 1/2. The 
residing of molecule 1 in the left half of the vessel and the residing 
of molecule 2 in the same half are statistically independent events. 
Hence, the probability of molecules 1 and 2 simultaneously occupy- 
ing the left half of the vessel equals the product of their individual 
probabilities of being there, i.e. (1/2) 2 . Continuing this reasoning, we 
find that the probability of all four molecules simultaneously residing 
in the left half of the vessel is (1/2) 4 . 

Similar reasoning shows that “the probability of any arrangement of 
the molecules in the vessel (say, one in which molecules 1 and 4 are 
in the left half and molecules 2 and 3 in the right one) also equals (1/2) 4 . 
Each of the arrangements is a microstate of the gas. It follows from 
what has been said above that the probability of all the microstates 
is the same and equals (1/2) 4 . 

Table 11.4 shows all the possible ways of distributing the molecules 
between the halves of the vessel (all the microstates of the gas). 
A state characterized by, for instance, the left half of the vessel con- 
taining one molecule (it is no difference which one) and the right half 
containing three is a macrostate. Inspection of the table shows that 
four microstates correspond to such a macrostate. Hence, the statis- 
tical weight of the given macrostate is 4, and the probability (con- 
ventional, and not thermodynamic) is 4/16. A macrostate in which 
both halves of the vessel contain the same number of molecules is 
realized with the aid of six microstates. Its statistical weight is 
accordingly 6 t and its probability (conventional) is 6/16. 
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Fig. 11.24 
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Table 11.4 


State 

Ways of realizing state 

Number of 
ways of 
realizing 
a given 
state (Q) 

No. of 
molecules 
at left 

No. of 
molecules 
at right | 

Nos. of molecules 
at left 

Nos. of molecules 
at right 

0 

4 

— 

1, 2, 3, 4 

1 

1 

3 

1 

2, 3, 4 




2 

1, 3, 4 

4 



3 

i, 2, 4 




4 

1, 2, 3 


2 

2 

1. 2 

3, 4 




1, 3 

2, 4 




1. 4 

2, 3 

0 



2, 3 

1, 4 




2, 4 

1, 3 




3, 4 

1, 2 


3 

1 

1, 2, 3 

4 




1, 2, 4 

3 

4 



1, 3, 4 

2 




2, 3, 4 

1 


4 

o 

warn 

— 

1 

Total ways 

2 4 = 16 


The above example shows that all the microstates of a given system 
are equally probable. Consequently, the statistical weight is propor- 
tional to the probability (conventional) of the macrostate. The state- 
ment that all microstates are equally probable forms the foundation 
of statistical physics and is called the ergodic hypothesis. 

According to Table 11.4, when we are dealing with four molecules, 
the probability of all the molecules gathering in one of the halves 
(left or right) of the vessel is quite great (one-eighth). Matters change 
appreciably, however, with an increase in the number of molecules. 

Let us find the number of ways (the number of microstates) in which 
a macrostate can occur characterized by the left half of the vessel 
containing n molecules of their total number N , and by the right 
half containing N — n molecules. We shall number the molecules 
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from 1 to N for this purpose. Next we shall take one molecule at a 
time and put it in the left half of the vessel. We can choose the first 
molecule in N ways, the second in N — 1 ways, the third in N — 2 
ways, and, finally, we can choose the n - th molecule in N — n + 1 
ways. We shall place the remaining N — n molecules in the right 
half of the vessel. 

We can thus see that the number z of ways in which we can random- 
ly choose n molecules for the left half of the vessel from their total 
number N is 


z = N (N — 1) (N — 2) . . . (N — n + 1) 
Multiplying and dividing this number by (AT — n)l, we get 


z 


Nl 

(*-")! 


(11.84) 


Not all z ways, however, result in microstates that differ from 
one another. Separate microstates differ only in the combination of 
the numbers of the molecules chosen for each half of the vessel, but 
not in the sequence in which these numbers were selected. For exam- 
ple, for N = 3 and n = 2, we get the selections 


1-2 2-1 3-1 

1-3 2-3 3-2 


Of these, selections 1-2 and 2-1 correspond to the same microstate 
(molecules 1 and 2 in the left half and 3 in the right one). The same 
relates to selections 1-3 and 3-1, and also to 2-3 and 3-2. Thus, 
selections differing only in the permutation of n numbers of the 
molecules chosen for the left half of the vessel (the number of these 
selections is n!) correspond to the same microstate. Hence, to obtain 
the number of microstates by means of which we can provide the 
macrostate (n, N — rc), we must divide the number z given by Eq. 
(11.84) by nl. The resulting expression for the statistical weight is 

O (»,*-»)- (11-85) 

It is easy to see that Q (2, 4 — 2) = 6, and Q (1, 4 — 1) = 4 (see 
Table 11.4). 

Table 11.5 gives the values of Q calculated by Eq. (11.85) for 
N = 24. 

The total number of ways of distributing 24 molecules between the 
two halves of a vessel is 2 24 = 16 777 216, and only in two cases are 
all the molecules concentrated in one of the halves. The probability 
of such an event is about 10 ~ 7 . Four cubic centimetres of air contain 
about 10 20 molecules. The probability of all these molecules gather- 
ing in one of the halves of a vessel is two divided by two raised to 
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Table 11.5 


Number of 
molecules 

0 

Probability 

Number of 
molecules 

Q 

Probability 

at 

left 

at 

right 

at 

left 

at 

right 

0 

24 


6xl0- 7 


15 

1307 504 

7.8X10-* 

1 

23 


1 .4x 10“ 6 

10 

14 

1 961 256 

0.117 

2 

22 


1 .6x 10“ & 

11 

13 

2496144 

0.149 

3 

21 


1.2x10-4 

12 

12 

2 704 156 

0.161 

4 

20 

10 626 

6.3x10-4 

13 

11 

2 496 144 

0.149 

5 

19 

42 504 

2.5x10-* 

.. 

. . 

..... 


6 

18 

134 596 

8xl0-» 

23 

1 

24 

1.4x10-* 

7 

17 

346 104 

2x10-* 

24 

0 

1 

6x10-* 

8 

16 

735 471 

4.4x10-* 






Total 2** = 16 111 216 ways 


the power 10 20 , i.e. about 10*“ 3x1019 . This probability is so small 
that we can consider it virtually equal to zero. 

Figure 11.25 depicts a graph showing how the number of mole- 
cules n in one of the halves of the vessel changes with time. This 

number fluctuates about the mean 
value equal to Nl 2. 

Random deviations of the values 
of a physical quantity x from its 
mean value ( x ) are called fluctua- 
tions of this quantity. Denoting 
the fluctuation by Ax, we find that 

Fig. 11.25 Ax = x — (x> (11.86) 

The arithmetical mean of the quantity (11.86) equals zero. Indeed, 
(Ax) - <(x - <x»> = <x) — (x) = 0 

This is why fluctuations are characterized by the mean square fluc- 
tuation equal to 

y r ((A'E) 5 > (11.87) 

The relative fluctuation of the quantity x is more indicative. It 
is determined by the ratio 

<*) 



( 11 . 88 ) 
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It is proved in statistical physics that the relative fluctuation of 
an additive quantity (i.e. a quantity whose value for the body as 
a whole equals the sum of the values for its separate parts) is inverse- 
ly proportional to the square root of the number of molecules N 
forming the body: 


/((A*) 2 ) __ 1 

<*> Yn 


(11.89) 


Let us calculate the relative fluctuation of the number of molecules 
in the left half of the vessel using the data of Table 11.4. We shall 
perform our calculations by Eq. (11.5). The values of the fluctuations 
and their probabilities P are given below. 


n — N/2 —2 —1 0 +1 +2 

P . 1/16 4/16 6/16 4/16 1/16 


According to these data 

((n—N/ 2)2) = ( — 2)2 x 1/16 + ( — 1)2 x 4/16 + (O) 2 x 6/16 + 

+ ( + 1)2 X 4/16 + ( + 2)2 x 1/16 = 1 


Hence, the mean square fluctuation equals Y 1 = 1* and the rela- 
tive fluctuation equals 1/2 (the mean value of n is 2). Similar cal- 
culations performed using the data of Table 11.5 give the value 
2.45 for the mean square fluctuation, and 0.204 for the relative 
fluctuation. It is easy to see that 

0.5 : 0.204 = 1/2474 (11.90) 

This proportion agrees with Eq. (11.89). 

Examination of Table 11.5 shows that deviations from the mean 
number of molecules (equal to 12) by not over 2 molecules occur 
with a probability of 0.7, and deviations by not over 3 molecules 
with a probability of 0.85. If the number of molecules could be frac- 
tional, it would be possible for us to say that the gas spends the 
majority of its time in states in which the deviations of the number 
of molecules from the mean value do not exceed the relative fluc- 
tuation, i.e. 2.45. 

Forming a proportion similar to (11.90) for N = 4 and N = 10 20 , 
we get the relative fluctuation (/ r ) of the number of molecules in the 
left half of the vessel for the case when N = 10 20 . This proportion 
has the form 

0.5 : / r = y 10 20 : 4 


whence f T — 10~ 10 . The result obtained signifies that the value of 
the number of molecules in one of the halves of the vessel undergoes 
changes that in the main do not exceed unity in the tenth signifi- 
cant digit* 
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We have considered the fluctuations of the number of molecules 
in one of the halves of a vessel. Other macroscopic characteristics 
such as the pressure and the density of the gas at different points of 
space also fluctuate, i.e. deviate from their mean values. 

A macrostate of a system is an equilibrium one when it has no 
trend of changing with time. It is clear that the absence of such 
a trend will be expressed the greatest in the most probable of all 
the macrostates conceivable for the given system. The probability 
of a state is proportional to its statistical weight. Therefore, the 
equilibrium state can be determined as the state whose statistical 
weight is maximum. 

A system in an equilibrium state deviates spontaneously from 
equilibrium from time to time. These deviations are insignificant 
and of a short duration, however. The system spends the overwhelm- 
ing part of its time in its equilibrium state characterized by the maxi- 
mum statistical weight. 

Statistical physics reveals the nature of irreversible processes. 
Let us assume that first a gas is in the left half of a vessel separated 
by a partition from the right empty half. If we remove the partition, 
the gas spontaneously spreads out over the entire vessel. This process 
will be irreversible because the probability of the fact that as a re- 
sult of thermal motion all the molecules will gather in one of the 
halves of the vessel, as we have seen, is virtually nil. Hence, the 
gas cannot again concentrate in the left half of the vessel by itself, 
without any external action on it. 

Thus, the process of the spreading of the gas over the entire vessel 
is irreversible because the reverse process is improbable. This con- 
clusion can be extended to other processes as well. An irreversible 
process is one whose reverse process is extremely improbable. 


11.11. Entropy 

We established in the preceding section that the probability of 
a macrostate (in the following we shall call it simply a state) is 
proportional to its statistical weight Q, i.e. to the number of micro- 
scopic ways in which the given macrostate can be achieved. We could 
therefore take this number itself, i.e. Q, as a characteristic of the 
probability of the state. Such a characteristic, however, would not 
have the property of additivity. To convince ourselves in the truth 
of this statement, let us divide a given system into two subsystems 
that do not virtually interact. Let these subsystems be in states 
with the statistical weights and Q 2 . The number of ways in which 
we can achieve the corresponding state of the system equals the 
product of the number of ways in which we can achieve the states 
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of each of the subsystems separately: 

Q = QiQ* (11.91) 

This expression shows that Q is not an additive quantity indeed. 
Taking logarithms of Eq. (11.91), we get 

In Q = In Q x + In Q 2 (11.92) 

A glance at Eq. (11.92) shows that In Q is an additive quantity. 
It is much simpler and more convenient to deal with additive quan- 
tities. For this reason, the quantity S proportional to the logarithm 
of the statistical weight is taken as a characteristic of the probability 
of a state. For a reason which will be explained below, we take the 
constant of proportionality equal to the Boltzmann constant k . 
The quantity 

S = k In Q (11.93) 

determined in this way is called the entropy of a system. 

The properties of the entropy indicated below follow from what 
was said in the preceding section: 

1. The entropy of an isolated system grows when an irreversible 
process occurs in it. Indeed, an isolated system (i.e. one left by it- 
self) passes over from a less probable state to a more probable one, 
and this is attended by a growth of quantity (11.93). 

2. The entropy of a system in its equilibrium state is maximum. 
We shall stress once more the not absolutely strict nature of the 

above statements. For example, the entropy of a system in an equi- 
librium state undergoes insignificant brief negative fluctuations. 
The latter are so small, however, that the entropy can virtually be 
considered constant and equal to the maximum value. 

The statement that the entropy of an isolated system can only grow 
(or remain constant when a maximum value is reached) is known 
as the law of entropy increase or the second law of thermodynamics. 
In other words, we can say that the entropy of an isolated system 
cannot decrease. 

Thus, when an irreversible process occurs in an isolated system, 
the entropy grows, i.e. the following relation is observed: 

dS> 0 (11.94) 

To see how the entropy of a non-isolated system behaves, let us 
establish the relation between the increment of the entropy dS and 
the amount of heat d'Q supplied to the system. Being a function of 
state, the entropy should be determined by the parameters of state 
of a body (or a system of bodies). An ideal gas has the simplest pro- 
perties. Its equilibrium state is completely determined by setting 
two parameters, for example, its volume V and temperature T. Let 
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us try to find the form of the function S = S ( V , T) for a monatomic 
ideal gas*. 

We shall consider a monatomic ideal gas in equilibrium in a ves- 
sel of volume F. External force fields are absent. The number of 
molecules in the gas is N and its temperature is T. 

The macrostate of the gas is characterized by the values of the 
parameters F and T y and its microstate is determined by setting the 
coordinates and velocities of all N molecules. The distribution of 
the molecules by coordinates and their distribution by velocities 
are independent. Therefore, the statistical weight Q of a macrostate 
can be represented in the form of the product of the factor Q sp deter- 
mining the number of different arrangements (permutations) of the 
molecules in space, and the factor £2 V determining the number of 
different distributions of the molecules by velocities: 

£2 = Q sp Q v (11.95) 

Indeed, each of the £2 sp arrangements in space can be realized to- 
gether with any of the Q v distributions by velocities. This gives us 
Eq. (11.95). 

Thus, in the case being considered, the expression for the entropy 
has the form 

S = k In Q = k In Q Sp + k In Q v (11.96) 


It can be seen from this equation that the finding of the entropy 
of an ideal gas consists in finding the numbers Q sp and £2 V . Having 
determined how these numbers depend on the parameters F and T 
of a gas, we shall find its entropy as a function of these parameters. 

To determine the number Q Sp , let us divide the volume F occupied 
by a gas into identical cubic cells. We shall choose the volume of a 
cell AF so that the number of cells 


r 


V 

AV 


(11.97) 


is much smaller than the number of molecules N (r - C N). Hence, 
many molecules will get into each cell on the average. We shall see 
below that the size of the cells (except for the condition r N) 
has no appreciable influence on the expression for the entropy. 

Let us consider a macrostate characterized in that the first cell 
contains n x molecules, the second cell — n 2 molecules, ..., the r-th 
cell — n T molecules (2 n i — N). We shall find the number of ways 
(i.e. the number of microstates) in which such a macrostate can be 
realized. For this purpose, we shall fix “sites” inside the cells at 
which we shall “place” the molecules in distributing them among the 
cells (in Fig. 11.26 these sites are designated by dots). 


* 


The following derivation was proposed by N. B. Narozhny 




Statistical Physics 


The molecules can be arranged at the sites depicted in Fig, 11.26 
in iV! ways (AH is the number of permutations of N molecules ar- 
ranged at N sites at a time). Permutations, however, in which the only 
change is the order of arrangement of the molecules at the sites 
of the 1st cell (the number of these permutations is n^), or at the n 2 
sites of the 2nd cell (the number of these permutations is n 2 !), etc. t 
do not result in a new microstate. We remind our reader that sepa- 
rate microstates differ only in the numbers of the molecules getting 


fst cell 2nd cell 

t— « • > > 1 » » « — —t 

1 2 ... n f 1 2... n 2 

Fig. 11.26 


r~th cell 

• * « « 1 

1 2... /v 


into different cells. Let us fix the numbers of the n x molecules in 
the first cell. The number of different permutations of the molecules 
in this cell corresponding to each of the possible distributions of the 
remaining molecules among the other cells is n^.. Hence, dividing 
the total number of permutations AH by rcj, we eliminate from our 
consideration the permutations differing only in the way of arrange- 
ment of the molecules in the first cell. Next dividing NMn^X by ti 2 !, 
we exclude from our consideration the permutations differing only 
in the way of arranging the molecules in the second cell. Continuing 
this process, we arrive at the equation 


N\ 


«Sp 


raj n 2 ! ... n r \ 


(11.98) 


that gives us the number of permutations of the molecules by cells 
differing only in the numbers of the molecules in different cells 
[compare with Eq. (11.85)1. This number is the “space” part of the 
statistical weight. 

Since we have assumed that an external force field is absent, in 
the equilibrium state the molecules are distributed over the volume 
with a constant density. Hence, the numbers n ly n 2l . . ., n T are 
identical on the average and equal n = N/r (r is the number of cells). 
Thus, for the equilibrium state, the “space” part of the statistical 
weight is 

o _ M 

(n\y 


Taking logarithms, we obtain 

In Qg p = In N\ — r In n\ (11.99) 

According to Stirling’s formula (see Appendix 3), we have 

In Nl « Nln N — N (11.100) 
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We shall use this formula to transform Eq. (11.99) as follows: 

In Q 8p « ArinAT — N— r(nlnn— n) = NlnN — Nlnn = Nln 

(we have taken into account that rn = N). The ratio Nln equals 
WAV’. Hence, 

lnQ 8p = ATln = Wln V — N\n AV (11.101) 

Let us now pass over to finding Q v . We shall introduce a space 
along whose axes the components of the molecule velocities are 
laid off (a u-space). Let us divide this space into identical cubic 
cells of volume AA. We shall see below that the value of AA, like 
that of AF, is of no significance; it is only important that the 
volume A A be sufficiently great for many molecules to “get” into it. 

In the equilibrium state, the density p of the points depicting the 
velocities of the molecules is determined by a Maxwell distribution 
function [see Eqs. (11.40), (11.53), and (11.63)]: 

p = Nf (v x , Vy, v z ) = NA 3 exp [ ] = 

~ N \2nkT) exp ( — Wr) 

Denoting the velocity corresponding to the i-th cell by v it we get 
the following value for the “density of the molecules” in the i-th 
cell: 

Pi N ( 2nkT ) eXp ( 2kT ) 

Finally, multiplying the density Pi by the volume of a cell AA, we 
get the number of molecules n t entering the i-th cell: 

’“-"{Tsr)’ n ' I v (-$)**■ f 11 - 102 * 


By analogy with Eq. (11.98), we conclude that the number of ways 
in which we can distribute the molecules among the cells with the 
given values of the numbers n t is 


Q v 


iVI 

flj/ Bjl ... B|| . • • 


(11.103) 


Unlike Eq. (11.98), the number of cells is now infinitely great. For 
cells sufficiently remote from the origin of coordinates, however, 
the numbers n t virtually equal zero. 

Taking logarithms of Eq. (11.103) yields 

In Q v = In AM — 2 n i l 

i 

Using expression (11.100), we get 
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lnQ v «iVln^V — N — 2( n il nn i — n t ) = NlnN — Snjlnnj (11.104) 

t i 

(here = JV). 

According to Eq. (11.102), we have 

ln», = lnA + lnAA + |-ln^ r -4lnr-^l 
Introducing this equation into expression (11.104), we get 

lnQ v = AlnA— (lnA^ + lnAA + l-ln^- — flnr) 2«i + 

i 

(11-105) 

i 

The expression - is equivalent to N = N ^kT, 

and the sum equals N. Taking this into account, we shall re- 
write Eq. (11.105) as follows: 

lnQ v =JVlnAT — Win N — AlnAA— |-Aln -^- + 

+ SNlnT + J j rN±kT- 

= iA'l»J--Wln4A+4jv[l-ln 1 i.] = 

= 4 Aln2? — + (U- 106 ) 

Here a stands for the expression in brackets that contains no para- 
meters of state of a gas. 

Assuming in Eqs. (11.101) and (11.106) that N equals the Avo- 
gadro constant N a and then introducing these equations into 
Eq. (11.96), we arrive at a formula for the entropy of one mole of 
a monatomic ideal gas: 

s m = kN A In V — kN A In AF -f 4 kN A InT — kN A In AA + 4 ^N A a 

The product kN A equals the molar gas constant R. Consequently, 

5 m =i?lnF + 4 J Rln?’ — i?ln(AFAA)+4#a 

Introducing the notation 

S 0 = — i? In (AF AA) + 4^ a 


(11.107) 
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3 

and taking into account that is the molar heat capacity of 

a monatomic gas at constant volume C v , we get the final formula 
S m = R In F + C v In T + S 0 (11.108) 

This formula determines the molar entropy of a monatomic ideal 
gas* as a function of the parameters of state F and T . Using an 
equation of state, we can pass over to an expression for the entropy 
through other parameters, for instance, through F and p . 

It can be seen from Eq. (11.107) that the choice of the size of the 
cells AF and AA only affects the value of the additive constant S 0 , 
and the entropy is determined by Eq. (11.108) with an accuracy up 
to this quantity. 

When the heat d'Q is supplied to a gas, either T changes (at a con- 
stant F), or F changes (at a constant T) y or both parameters T and 
F change. The entropy changes accordingly. To relate this change 
to d'Q f let us find the differential of Eq. (11.108) and multiply it 
by T . The result is 

T dS m = dV m + C v dT 

v m 

(to stress that we have in view a mole of the gas, we have used the 
subscript “m” with V). 

The addend C v dT gives the increment of the internal energy of 
a gas dU m . Assuming that the process of supplying the heat d'Q 
is reversible, we can represent the addend ( RT/V m ) dV m in the form 
p dV m = d'A. We thus arrive at the equation 

TdS m =p dV m + dU m 

Owing to the additivity of S, V, and U , a similar equation holds 
for an arbitrary mass of a gas: 

T dS = p dV + dU = d'A + dU 

According to the first law of thermodynamics, the right-hand side 
of this equation equals d'Q. Therefore, 

T dS = d'Q 

Hence, 

dS m . id = — (reversible process) (11.109) 

(the subscript “m.id” signifies “monatomic ideal gas”). 

We have obtained Eq. (11.109) when considering a monatomic 
ideal gas. It is simple to extend it, however, to any thermodynamic 

* We shall show in Sec. 12 4 that Eq. (11.108) also holds for an ideal gas 
with polyatomic molecules. 




system. Assume that we have an isolated system in an equilibrium 
state whose composition, in addition to a monatomic ideal gas, 
includes other bodies whose combination we shall call a subsystem. 
All parts of the system have the same temperature (otherwise the 
state of the system will not be an equilibrium one). Owing to addi- 
tivity, the entropy of the system S sy st can be written in the form 

Ssyst = Ssub ^ m • Id 

where S snh is the entropy of the subsystem, and S m . ld is the entropy 
of a monatomic ideal gas. Assume that the temperature of the gas 
experienced an infinitely small fluctuation dT. As a result, the gas 
will obtain the amount of heat d'Q m . xd from the subsystem. The 
latter will receive the heat d'Q su b Owing to the small- 

ness of dT y this process can be considered as reversible. Consequently, 
the entropy of the gas will receive the increment dS m . ld = d'Q m .14 IT. 

When a reversible process occurs in an isolated system, the entropy 
of the system remains constant. It thus follows that 

dS sy Bt “ ^*8ub dS m. id = 0 

Taking into account the value dS m , ld , we shall get the following 
expression for the increment of the entropy of the subsystem: 

dS Suh = - dS m . w = ,d = 

Thus, for a combination of arbitrary bodies too, the formula 

dS O 

dS = -jr- (reversible process) (11.110) 

holds. Here d'Q is the amount of heat received by the system in a re- 
versible process, and T is the temperature of the system. 

We must note that whereas d'Q is not a total differential, 
Eq. (11.110) is a total differential (the entropy is a function of 
state). 

Now we are in a position to explain why we took the Boltzmann 
constant k as the constant of proportionality in Eq. (11.93). This 
resulted in the proportionality constant between dS and d'QIT 
being equal to unity [see Eq. (11.110)1. 

A state achieved in a relatively small number of ways is called 
ordered or not random. A state achieved in many different ways 
is called disordered or random. The entropy is thus a quantitative 
measure of the degree of molecular disorder in a system. This cir- 
cumstance makes it possible to understand the meaning of 
Eq. (11.110). The supply of heat to a system results in greater ther- 
mal motion of the molecules and, consequently, in an increase in 
the degree of disorder in the system. The higher the temperature, 
i.e. the greater the internal energy of the system, the relatively 
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smaller is the fraction of the disorder due to the supply of the given 
amount of heat d'Q . 

The reversibility of the process in the course of which the heat 
d'Q is supplied to the system is an important condition forEq. (11.110) 
to hold. If the amount of heat d'Q is imparted to the system in the 
course of an irreversible process, the entropy grows both as a result 
of supplying heat and as a result of the irreversibility of the process. 
We therefore have the inequality 

dS>-£r- (irreversible process) (11.111) 

When d'Q vanishes, this inequality transforms into expression 
(11.94). By T in formula (11.111), we mean the temperature of the 
reservoir from which the given system receives the heat d'Q . The 
temperature of the system in an irreversible process may not have 
a definite value because the state of the system is not an equi- 
librium one. 

We can combine Eq. (11.110) and formula (11.111) and write 

dS^-^~ ( 11 . 112 ) 

The equality sign relates to reversible processes, and the non-equali- 
ty sign to irreversible ones. 

Expression (11.112) is the foundation for thermodynamic ap- 
plications of the concept of entropy. These applications will be dealt 
with in the following chapter. 

At absolute zero, any body, as a rule*, is in its basic state whose 
statistical weight equals unity (Q = 1). Equation (11.93) gives 
a value of zero for the entropy in this case. It thus follows that when 
the temperature of a body tends to absolute zero , its entropy tends to 
zero : 

lim5 = 0 (11.113) 

r-o 

This statement is the content of Nernst’s theorem. It is sometimes 
called the third law of thermodynamics. 


* There are exceptions to this rule which we shall not discuss. 




CHAPTER 12 THERMODYNAMICS 


12.1. Fundamental Laws 

Thermodynamics originated as a science on the conversion of heat 
into work. The laws on which thermodynamics is based, however, 
have such a general nature that thermodynamic methods at present 
are being used with great success to study numerous physical and 
chemical processes, as well as the properties of a substance and radia- 
tion. As we have already noted in Sec. 10.1, thermodynamics does 
not consider the microscopic picture of phenomena in studying the 
properties and processes of the transformation of a substance. It 
treats phenomena on the basis of fundamental laws extracted from 
experiments. For this reason, the conclusions which thermodyna- 
mics arrives at have the same degree of authenticity as the laws it 
is based on. The latter, in turn, are a generalization of an enormous 
amount of experimental data. 

Two laws form the foundation of thermodynamics. The first of them 
establishes the quantitative relations attending the conversions of 
energy from kind to kind. The second law determines the conditions 
in which these conversions are possible, i.e. it determines the pos- 
sible directions of processes. 

The first law of thermodynamics states that the heat supplied to 
a system is spent on an increment in the internal energy of the system 
and on work done by the system on external bodies : 


Q = u t - U x + A 

(12.1) 

or in the differential form: 


d'Q = dU + d'A 

(12.2) 


[see Eqs. (10.7) and (10.9)]. 

The first law is sometimes worded as follows: it is impossible to 
have a perpetual motion machine (perpetuum mobile) of the first kind 7 
i.e . such a periodically functioning machine that would do work in 
a greater amount than the energy it receives from its surroundings . 

Any machine or engine is a system repeatedly performing a cyclic 
process (a cycle). Assume that in the course of a cycle the working 
substance (for example, a gas) first expands to the volume F 2 , and 
then is again compressed to its initial volume V x (Fig. 12.1). For 
the work during a cycle to be greater than zero, the pressure (and, 
consequently, also the temperature) in the expansion process should 
be greater than in compression. For this purpose, heat must be supplied 
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to the working substance in expansion, and heat must be removed 
from it in compression. 

The working substance returns to its initial state upon complet- 
ing a cycle. Therefore, the change in the internal energy during 
a cycle equals zero. The amount of heat supplied to the working sub- 
stance during a cycle is Q x — Q' 2i where is the heat received by the 
working substance in expansion, and Q' 2 is the heat given up in com- 
pression. The work A done during a cycle equals the area of the 

cycle (see Sec. 10.6). Equation (12.1) 
written for a cycle thus has the 
form 

A = Q l -Q’ 1 (12.3) 

A periodically functioning ma- 
chine doing work at the expense 
of heat received from an external 
source is called a heat engine. 
We can see from Eq. (12.3) that not 
all the heat Q 1 received from an 
external source is used to obtain 
useful work. For a machine or 
engine to operate in cycles, part 
of the heat equal to Q 2 must be 
returned to the surroundings and is therefore not used for its direct 
purpose (i.e. for doing useful work). It is clear that the more com- 
pletely a heat engine converts the heat received from a source into 
useful work A , the more profitable is this engine. It is therefore 
customary practice to characterize a heat engine by its efficiency r\ 
determined as the' ratio of the work A done during a cycle to the 
heat Q x received during the cycle: 

r\^~ (12.4) 

In view of Eq. (12.3), the expression for the efficiency can be writ- 
ten in the form 

(42 - 5) 

It follows from the definition of the efficiency that it cannot be greater 
than unity. 

If we reverse the cycle shown in Fig. 12.1, we shall get a cycle of 
a refrigerating machine. Such a machine removes the heat <? 2 from 
a substance at the temperature T 2 and gives up the heat Q[ to a sub- 
stance with a higher temperature T ± . The work A' must be done on 
the machine during a cycle. The effectiveness of a refrigerating 
machine is characterized by its refrigerating factor (or coefficient 
of performance) p. The latter is defined as the ratio of the heat@ 2 


P 
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removed from a body being cooled to the work A f 
the machine: 


p= 


<h_ 

A' ~ 


Q* 

Qi-Q* 


spent to actuate 


( 12 . 6 ) 


The second law of thermodynamics, like the first one, can be 
formulated in several ways. We have acquainted ourselves with one 
of them in Sec. 11.11. It is the statement that the entropy of an isolat- 
ed system cannot diminish : 

dS > 0 (12.7) 

The German physicist Rudolf Clausius (1822-1888) stated the 
second law as follows: processes are impossible whose only final result 
would be the flow of heat from a colder body to a warmer one . Matters 
must not be understood in such a way that the second law in general 
prohibits the transfer of heat from a colder body to a warmer one. 
It is exactly such a transfer that is performed in a refrigerating ma- 
chine. This transfer, however, is not the only result of a process. It is 
attended by changes in the surrounding bodies associated with the 
performance of the work A' on the system. 

We shall show that an imaginary process performed in an isolated 
system and contradicting the second law as worded by Clausius 
is attended by a decrease in the entropy. We shall thus prove the 
equivalence of Clausius’s statement and of the statistical statement 
of the second law according to which the entropy of an isolated system 
cannot diminish. 

We shall first make the following remark. Assume that a body 
exchanges heat with another body, which we shall call a heat source 
or heat reservoir. Let the heat capacity of the reservoir be infinitely 
great. This signifies that when the reservoir receives or gives up a 
finite amount of heat, its temperature does not change. A process 
occurring in a body and attended by the exchange of heat with 
a reservoir can be reversible only if in the course of this process the 
temperature of the body will equal that of the corresponding reser- 
voir. Indeed, if, for example, a body receives heat from a reservoir 
having the temperature T x while having a temperature lower than 
7\, then when the same process is reversed the body can return to 
the reservoir the heat received from it if its temperature at any rate 
is not lower than T x . Consequently, in the forward and the reverse 
course of the process, the temperature of the body will differ, the 
body will pass in both cases through different sequences of states 
(characterized by different temperatures), and the process being 
considered will be irreversible. 

Thus, a process attended by heat exchange can be reversible only 
if upon receiving heat and returning it in the reverse stroke to the 
reservoir, the body has the same temperature equal to that of the 
reservoir. Strictly speaking, when receiving heat, the temperature of 
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the body must be lower than that of the reservoir by an infinitely small 
value (otherwise no heat will flow from the reservoir to the body), 
and when giving up heat, the temperature of the body must be 
higher than that of the reservoir by an infinitely small value. 

Consequently, the only reversible process attended by heat ex- 
change with a reservoir whose temperature remains constant is an 
isothermal process at the temperature of the reservoir. 

Let us consider an isolated system consisting of two bodies of the 
same heat capacity C. Assume that body B transfers the heat Q to 
body A y and as a result the temperature of A rises from T At 0 to T A , 
while the temperature of B lowers from o to 7^ (here T B < 7b, 0 < 
< T A% 0 < T A )- Such a process contradicts the second law formulated 
by Clausius. Let us find the change in the entropy in this case. 

In the course of this process, heat exchange occurs between bodies 
with different temperatures. In view of what has been said above, 
such a process is irreversible. Equation (11.110), however, may be 
applied only to reversible processes. To find the change in the entro- 
py in an irreversible process, we proceed as follows. We consider 
a reversible process that brings the system to the same final state 
as the given irreversible process, and calculate the change in the 
entropy for this process by the equation 


iS 2 


2 



( 12 . 8 ) 


[see Eq. (11.110)1. 

In accordance with what has been said above, we shall consider 
a reversible process in the course of which body B gives up the 
heat Q in portions of d'Q to a consecutive series of reservoirs with 
temperatures having all the values from T B% o to 7m and body A 
receives the heat Q in portions of d'Q from a number of reservoirs with 
temperatures from T At0 to T A . As a result, the system will pass 
reversibly from the state in which the bodies have the tempera- 
tures T At0 and 7 7 B| o 1° the state in which the temperatures of the 
bodies are T A and Tb- The increment of the entropy in the course 
of this process is 


AS = AS a + ASb= j ■ £ f^+ j 


‘A .« 


‘B, « 

= Cln 


T A 


T A. i 


-4-Cln 




Tb. 


T A, o T B, o 


Taking into account that T A — T A , o + a and Tb = 2b, o — a 
(here a = QIC > 0), we can write AS in the form 


AS = C In (fA - 0 j ^ a) jr rB - 0 = C In [ 1 • 

* A, q * B. o L 


g( r A. Q — r B, o) 


a* 


Ta. # r B, ( 


r A. O^B.O 


] 
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Since T a, o> ^b,o» the expression in brackets is less than unity, 
and, consequently, AS < 0. We have thus shown that in the course 
of an imaginary process contradicting the second law as stated by 
Clausius, the entropy diminishes, which contradicts the law of non- 
diminishing of the entropy. 

The British scientist Lord Kelvin (William Thomson, 1824- 
1907) proposed still another statement of the second law of ther- 
modynamics. It is worded as follows: such processes are impossible 
whose only -final result would be the removal of a definite amount of 
heat from a body and the complete conversion of this heat into work . 

It may seem on the face of it that this statement contradicts, 
for example, the process of isothermal expansion of ah ideal gas. 
Indeed, all the heat received by an ideal gas from a body is com- 
pletely converted into work. The reception of heat and its conversion 
into work are not the only final result of the process, however; as 
a result of the process the volume of the gas changes. 

In a heat engine, the conversion of heat into work is attended 
without fail by an additional process — the transfer of a certain 
amount of heat Q' 2 to the colder body. Hence, the heat Q x received 
from the warmer body cannot be completely converted into work. 

It is easy to see that Kelvin’s statement logically follows from 
that of Clausius. Indeed, work can be completely transformed into 
heat, for example, in friction. Therefore, by using a process for- 
bidden by Kelvin’s statement to convert the heat removed from 
a body completely into work, and then transforming this work by 
friction into heat transferred to another body with a higher temper- 
ature, we would carry out a process that is impossible according 
to Clausius’s statement. 

By using processes forbidden by the second law of thermodyna- 
mics, we could create an engine doing work at the expense of the 
heat received from such a virtually inexhaustible source of energy, 
for example, as the ocean. In practice, such an engine would be equiv- 
alent to a perpetual motion machine. This is why the second law 
of thermodynamics is sometimes stated as follows: a perpetual motion 
machine of the second kind is impossible, i.e. such a periodically operat- 
ing engine that would receive heat from a single reservoir and completely 
convert this heat into work. 


12.2. The Carnot Cycle 

It can be seen from the preceding section that the presence of 
two heat reservoirs is needed for the operation of a heat engine. 
In the course of a cycle, the engine receives the heat Q 4 from one 
of them having the higher temperature T i and called the high tem- 
perature reservoir (or heater, or heat source). The engine gives up 
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the heat Q 2 to the second one having the lower temperature T x 
and called the low temperature reservoir (or cooler, or heat sink). 

Let us assume that the heat capacity of the reservoirs is infinitely 
great. This signifies that when the reservoirs give up or receive a finite 
amount of heat, their temperatures do not change. Let us see what 
reversible cycle can be performed by the working substance of the 
engine in these conditions. For brevity’s sake, we shall call the work- 
ing substance of the engine simply the substance. 

The cycle being considered can evidently consist both of processes 
during which the substance exchanges heat with the reservoirs, and 



Fig. 12.2 Fig. 12.3 


of processes not attended by heat exchange with the surroundings, 
i.e. adiabatic processes. We established in the preceding section 
that the only reversible process attended by heat exchange with 
a reservoir whose temperature remains constant is an isothermal 
process going on at the temperature of the reservoir. 

We thus arrive at the conclusion that a reversible cycle performed 
by a substance exchanging heat with two heat reservoirs of infinitely 
great capacity canr consist only of two isotherms (at the tempera- 
tures of the reservoirs) and two adiabats. Such a cycle was first pro- 
posed for consideration by the French engineer Sadi Carnot (1796- 
1832) and is called the Carnot cycle. It must be noted that the Car- 
not cycle is reversible by definition. 

In an adiabatic process, d'Q = 0. Hence, according to Eq. (11.110), 
in a reversible adiabatic process dS = 0 and, consequently, the 
entropy remains constant. This is why a reversible adiabatic process 
is called isentropic. Using this term, we can say that a Carnot cycle 
consists of two isotherms and two isentropes. In a T-S diagram, 
this cycle appears as shown in Fig. 12.2. We must note that the shape 
of a Carnot cycle in a T-S diagram does not depend on the proper- 
ties of the substance (or system of substances) for which it is depicted. 

Figure 12.3 shows a process that transfers a system from state 1 
to state 2. According to Eq. (11.110), the elementary amount of heat 
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dfQ received by the system can be represented in the form T dS. 
Hence, the area of the hatched strip in Fig. 12.3 equals d'Q y and 
the area of the figure confined by curve 1-2 gives the amount of heat 
received by the system in the course of the process. Similarly, the 
area of the figure confined by the curve depicting the process in a 
p-V diagram gives the work done by the system in the course of 
the process (see Fig. 10.4). 

In accordance with the above, the area of the cycle in Fig. 12.2 
gives the heat received by the system in the course of the cycle (it 
equals — (?'). Similarly, the area of the cycle in a p-V diagram 
gives the work done by the system during the cycle (see Fig. 10.5). 

The heat received by a system in the course of an arbitrary rever- 
sible process can be calculated by the formula 

2 

(12.9) 

1 


[compare with Eq. (10.12)1. 

Let us find the efficiency of a Carnot cycle. Upon completing the 
cycle, the system returns to its initial state. Hence, the total change 
in the entropy during the cycle is zero. Along path 1-2 (see Fig. 12.2), 
the system receives the heat Q 1 from the reservoir with the tempera- 
ture 2V The entropy increment on this path is 


AS 


12 



d'Q _ 1 
7i ~ T x 



Along path 3-4 , the system gives up the heat Q' 2 to the reservoir 
with the temperature T 2 • The removal of the heat Q' 2 from a sub- 
stance is equivalent to supplying the heat — Q' 2 to it. Therefore, 
the entropy increment along path 3-4 is 


A1S34 



~<?2 

T* 


The entropy is constant along 2-3 and 4-1 . Thus, the total entropy 
increment during the cycle is 

A5 12 + A5 3 * = -^~^- = 0 (12.10) 

It can be seen from Eq. (12.10) that 

Qi T , 


( 12 . 11 ) 
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Equation (12.5) for the efficiency of a heat engine can be written 
in the form 

< 12 . 12 ) 

Substituting in this equation for Q'JQ 1 its value from Eq. (12.11), 
we get 

•n = i — t - l t~ (12,13) 

In deriving Eq. (12.13), we made no assumptions on the proper- 
ties of the working substance and the design of the heat engine. 
We thus arrive at the statement that the efficiency of all reversible 
machines operating in identical conditions ( i.e . at the same tempera- 
tures of the hot temperature and cold temperature reservoirs) is the same 
and is determined only by the temperatures of the two reservoirs . This 
statement is known as the Carnot theorem. 

Let us consider an irreversible machine operating with the hot 
temperature and cold temperature reservoirs similar to those of 
a reversible machine operating according to the Carnot cycle. As- 
sume that upon completion of the cycle the machine returns to its 
initial state, which we shall consider an equilibrium one. Since the 
entropy is a function of state, its increment during the cycle should 
equal zero: 

§dS = 0 

Since the processes which the cycle consists of are irreversible, 
for each elementary process the inequality dS > d'QIT holds [see 
expression (11.111)]. Hence, from the condition that the total en- 
tropy increment during the cycle equals zero, it follows that 

whence 

§-^- <0 (12.14) 

Let us divide the integral in (12.14) into four addends: 

§ -£ = j^ + j^ + j-£ + J4£<0 («.«, 

T x Adi T, Ad2 

The first addend corresponds to the process of obtaining the heat Q x 
(this amount of heat does not necessarily coincide with the heat Q 1 
which a reversible machine receives during a cycle) from the reser- 
voir with the temperature TV The second addend in (12.15) corres- 
ponds to the first adiabatic path of the cycle. The third addend cor- 
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responds to the process of transferring the heat (?' (this amount of 
heat does not necessarily coincide with the heat Q' t which a revers- 
ible machine gives up during a cycle) to the reservoir with the 
temperature T t . Finally, the fourth addend in (12.15) corresponds 
to the second adiabatic path of the cycle. On the adiabatic paths, 
d'Q = 0, therefore the corresponding integrals vanish. The integral 
corresponding to the path T x equals Q 1 /T 1 (we remind our reader 
that for an irreversible process, the denominator of the ratio d’Q/T 
contains the temperature of the reservoir from which the given sub- 
stance receives the heat d'Q). The 


integral corresponding to the path 
T 2 equals — Q' 2 /T 2 . We thus arrive 
at the inequality 

^--#<0 (12.16) 
1 1 1 2 

We find from (12.16) that 
Qi 

and, consequently, 

T1 = 1 2L<1 — h- = Jj—l± 

1 <?x Ti Ti 

(12.17) 


11 = 1 — 


r, T t -T 9 
\ r x 



The result obtained signifies that Fig. 12.4 

the efficiency of an irreversible 

machine is always smaller than that of a reversible one operating 
in the same conditions. 

The form of a Carnot cycle in a p-V diagram depends on the prop- 
erties of the substance performing the cycle. A cycle for an ideal 
gas is shown in Fig. 12.4. The efficiency of a Carnot cycle for an 
ideal gas can be calculated without resorting to finding the entropy 
increment. 

The internal energy of an ideal gas remains constant in an iso- 
thermal process. Therefore, the heat Q x received by the gas equals 
the work A 12 done by the gas upon transition from state 1 to state 2 
(Fig. 12.4). This work, according to Eq. (10.60), is 

Q l =A„ = ^ f RT 1 ln-£- (12.18) 


where m is the mass of the ideal gas in the machine. The heat Q’ t 
given up to the low temperature reservoir equals the work A' u done 
to compress the gas from state 3 to state 4 . This work is 

Q’ t =A'„ = ^.RT t ln£- (12.19) 


23—7795 
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For the cycle to be closed, states 1 and 4 must be on the same adia- 
bat. Hence the condition follows that 


T t Vl 


-l 


•Tjnr 1 


( 12 . 20 ) 


(see adiabat equation (10.41)]. Similarly, since states 2 and 3 are 
on the same adiabat, the condition is observed that 




-i 


( 12 . 21 ) 


By dividing Eq. (12.21) by (12.20), we arrive at the condition for 
the cycle to be closed: 


V* 

v t 


(12.22) 


Now let us introduce Eqs. (12.18) and (12.19) into Eq. (12.5) 
for the efficiency: 

m pip l n m r>T l n Z* 

■jfRTi l nT --— 


rj = 


%-RT, InZ 2 - 


Finally, taking condition (12.22) into account, we get 

T t — r, 

T i=- v - 


which coincides with Eq. (12.13). 


12.3. The Thermodynamic 
Temperature Scale 


The theorem on the efficiency of reversible machines not depending 
on the properties of the working substance proved in the preceding 
section makes it possible to establish a temperature scale that does 
not depend on the choice of the thermometric body. In accordance 
with this theorem, the quantity 


Qi 

Qi 


and, consequently, the ratio Q'JQi for a Carnot cycle depend only 
on the temperature of the high temperature and low temperature 
reservoirs. Denoting the values of these temperatures according to 
a scale that we meanwhile do not know by Q 1 and 0 2 , we can write 
that 


f-=7(0i, 02) 


(12.23) 




Thermodynamics 


355 


where / (0 X , 0 S ) is a universal (i.e. identical for all Carnot cycles) 
function of the high temperature and low temperature reservoirs* 
Equation (12.23) permits us to determine the temperature of bodies 
through the amounts of heat received and given up in Carnot cycles. 

We shall prove that function (12.23) has the 
following property: 

/(ei,e 2 )=-!$g (12.24) 

where 0 (0) is again a universal function of the 
temperature. Let us consider two reversible ma- 
chines M x and M t (Fig. 12.5), the cooler (low tem- 
perature reservoir) of one of them simultaneously 
being the heater (high temperature reservoir) of 
the other. Let the second machine take the same 
amount of heat from the reservoir at the temper- 
ature 0 X that the first machine supplies to it. 

For machine M XJ we have Q x = Qj and 
<?' = Qu- Hence, Eq. (12.23) for this machine 
has the form 

-^=/(e 3, e x ) (12.25) 

For machine Af a , we have Q x = Qn 9 Q' = Qm. Hence, by 
Eq. (12.23), 

-^ = /(6 1 , e 2 ) (12.26) 

Considering machines M x and ikf 2 , and also the reservoir with the 
temperature 0 X as a single reversible machine that receives the heat 
Qi from the heater with the temperature 0 3 and gives up the heat 
Qm to the cooler with the temperature 0 2 , we can write: 

-2gi-=/(e 3 ,e 2 ), (12.27) 

Dividing Eq. (12.27) by (12.25), we find that 

gm _ / (,%< Og) 
gn - /(e„ e x ) 

A comparison of this equation with Eq. (12.26) gives ns the expres- 
sion 

(12 - 28> 

This equation relates the temperatures 0 X and 0 2 of two substances, 
but the temperature 0 3 of a third substance figures in it. If we agree 
once and forever on the selection of this substance, i.e. make 0 3 






356 


Molecular Physics and Thermodynamics 


constant, we shall reduce the function / (0 3 , 0) in the numerator and 
denominator of Eq. (12.28) to a function of a single variable 0. 
Denoting this function by 0 (0) we arrive at Eq. (12.24). 

The function 0 (0) depends only on the temperature. Therefore 
its value can be used to characterize the temperature of the corres- 
ponding substance, i.e. assume that the temperature of the substance 
is 0, where 0=0 (0). Equation (12.23) thus becomes 

# = 1* (12.29) 


Equation (12.29) is the foundation of the so-called thermodynamic 
temperature scale. A merit of this scale is that it does not depend 
on the choice of the substance (the working substance in a Carnot 
cycle) used for measuring the temperature. 

In accordance with Eq. (12.29), to compare the temperatures of 
two bodies, we must carry out a Carnot cycle using these bodies as 
the high temperature and low temperature reservoirs. The ratio 
of the amount of heat given up to the M low temperature reservoir” 
body to the amount of heat removed from the “high temperature 
reservoir” body gives the ratio of the temperatures of the two bodies. 
To uniquely determine the numerical value of 0, we must come to 
an agreement on the choice of the temperature unit, i.e. the degree. 
The absolute degree is defined as one-hundredth of the difference 
between the temperature of water boiling at atmospheric pressure 
and that of melting ice. Thus, the degree of the absolute thermodynam- 
ic scale equals the degree of the ideal gas scale. 

It is easy to see that the thermodynamic temperature scale coin- 
cides with the ideal gas scale. Indeed, according to Eq. (12.11), we 
have 



(12.30) 


Comparing Eqs. (12.29) and (12.30), we find that 

_^l_A 

6 X ~ T x 


Hence, 0 is proportional to 2\ and since the degree of both scales 
is the same, 0 = T. 


12.4. Examples 

of Calculating the Entropy 

The entropy is a function of state. It must therefore depend on 
parameters determining the state of a system. For example, it can 
be represented as a function of p and T , or as a function of V and T 9 
and so on. Let us assume that a body is heated at the constant pres- 
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sure p from absolute zero to the temperature T, and that the heat- 
ing process is reversible. Hence, according to Eqs. (11.110) and 
(10.24), the entropy of a body at the pressure p and temperature T 
is determined by the expression 



C p (T) dT 
T 


(12.31) 


where C p ( T ) is the heat capacity of the body at constant pressure, 
which is a function of temperature. Similarly, the entropy as a func- 
tion of the volume F and temperature T can be represented in the 
form 

S (F, T) = j - feifl-ff 1 (12.32) 

0 


where C v is the heat capacity of the body at constant volume. 

It can be seen from Eqs. (12.31) and (12.32) that the heat capaci- 
ties C p and C v (and also the heat capacity in any other process) 
vanish at absolute zero. Indeed, if the heat capacity did not tend 
to zero, then the integrand at T tending to zero would grow without 
restriction, as a result of which the integral would be diverging 
(i.e. would become infinite). 

1. Entropy of an Ideal Gas. In Sec. 11.11, we found an expression 
for the entropy of a monatomic ideal gas ^i.e. a gas for which C v = 

Now, using Eq. (11.110), we shall get an expression for 

the entropy of an ideal gas with any molecules. Since the entropy 
is additive, it is sufficient to find its value for a mole of the gas 
S m . The entropy of an arbitrary amount of the gas will be S = 
= (m/M) S m . 

We shall characterize the state of a substance by the parameters 
V and T , but shall not consider that the process being studied is 
isochoric. According to Nernst’s theorem and Eq. (11.110), we have 

(V. T) 

S m (V,T)= j ££ (12.33) 

o 

where the symbol (F, T) designates the state of the gas (we have in 
mind the volume of a mole). Integration is performed over an arbi- 
trary reversible process transferring the substance from its state at 
absolute zero to the state characterized by the volume F and the 
temperature T. 

Let us take the volume F 0 and the temperature T 0 at which the 
substance is sure to be an ideal gas, and divide the integral in 
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Eq. (12.33) into two: 


(V.,T.) 


(V, T) 


5m (V, T)= j *£-+ j 


££ 

T 


(Vo, To) 


(12.34) 


The first integral is a number that we shall denote by S ( V 0 , 7* 0 ). 
The second integral is a function of V and T . To find the form of this 
function, let us write d'Q as d'Q = C v dT + p dV (in the integra- 
tion interval the substance behaves like an ideal gas). Dividing 
d'Q by T and substituting R/V for p/T in accordance with an equa- 
tion of state, we get 


(V. T) 


J 


(V e . T.) 


c v dr 
r 


V 




Thus, Eq. (12.34) becomes 

5 m (V, T) = S (V 0 , To) + C v In ~~ + R In — (12.35) 

We can transform this equation as follows*: 

S m = C v In T + R In V + S 0 (12.36) 


where S 0 is a constant equal to S (F 0 , T 0 ) — In T 0 — In Vo- 
lt must be noted that the relations with which we have to deal 
in practice usually include either derivatives of the entropy with 
respect to parameters of state or a change in the entropy. In these 
cases, there is no need to find the value of the additive constant in 
the expression for the entropy. 

Equation (12.36) gives an expression for the entropy of a mole of 
an ideal gas in variable V and T. We can use an equation of state 
to pass over to expressions for the entropy in other variables. Using 
V = RT/p in Eq. (12.36), we get 

S m = C v In T + R In R + R In T — R In p + S 0 


Taking into account that for an ideal gas C Y + R = C pt we can 
write 

S m = C p lnT — Rlnp + S' 0 (12.37) 

where S' 0 — S 0 + R In R. 

Finally, substituting pV/R for T in Eq. (12.36), we arrive at the 
equation 

S m = C v In p + C p In V + S’ 0 (12.38) 

where SI — S 0 — C v In R. 

* One must not be confused in seeing a quantity having a dimension inside 
a logarithm. Expressions containing In / always have an addend including 
In fo (fo I s a constant) with a sign such that In / and In fa can be combined 
into a single addend of the kind In (///o). 
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2. Entropy of Water. The changes in the heat capacity of water 
within the interval from 0 to 100°Cdo not exceed 1%. Therefore, 
within this temperature interval, the specific heat capacity of water 
may be considered constant and equal to c = 4.2 kJ/(kg- K). Accord- 
ingly, denoting by s (273) the specific entropy of liquid water at 
0°C and by s(T) the specific entropy of water at the temperature T 
(here 273 < T < 373), we can write that 

S (T) — s (273) j = c ln ^3 

273 

whence 


s{T) =clnr + {s (273) — c In 273} = c In T + const (12.39) 


3. Change in Entropy in Melting. If the pressure does not change, 
then melting proceeds at a constant temperature. Accordingly, the 
increment of the specific entropy is 



801 



h. 

T t 


(12.40) 


where Lt is the specific heat of fusion. When a substance solidifies, 
its specific entropy diminishes by the same amount. 

The formula for the increment of the specific entropy upon evap- 
oration differs from Eq. (12.40) only in that it includes the heat 
of vaporization and the boiling point instead of the heat of fusion 
and the melting point. 


12.5. Some Applications of Entropy 

Let us take the volume V and the temperature T as the independ- 
ent parameters characterizing the state of a substance. Hence, the 
internal energy of the substance will be a function of these parame- 
ters: U = U (F, T). In this case, the expression of the first law of 
thermodynamics has the form*. 

)r dT + (w) T dV +< ,dr < iSU1 > 


* The total differential of the function / (x, y) of the variables x and y is 
determined by the expression 

This expression gives the increment of the function / (x, y) when the variables 
x and y receive the increments dx and dy [see Eq. (3.33)]. 
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It is customary practice in thermodynamics to write partial deriva- 
tives of functions with respect to the state parameters with a sub- 
script indicating what parameter is assumed to be constant in differ- 
entiation. This is essential in connection with the fact that, for 
example, we can consider the partial derivative of U with respect 
to T provided that p = const. This derivative is denoted by the 
symbol ( dUldT) p and, generally speaking, has a different value than 
(dU/dT) y . 

Dividing Eq. (12.41) by T , we get the increment of the entropy: 

"“{T(4FM dr +{H(4H-+*]}‘ n ' < 12 - 42 > 

Considering the entropy as a function of the parameters V and T % 
we can write the increment of the entropy in the form 

dS - (■fr)r iT + {w')i dV 

A comparison with Eq. (12.42) shows that 

(*),-#■[(*),+'] 

The mixed partial derivatives of a function / (x, y) satisfy the 
condition 

d 2 f __ a 2 / 

dx dy dy dx 

Accordingly, 

d ( 9S \ d i dS \ 

dV { dT )v~ dT \ dV ) T 

The introduction of Eqs. (12.43) into this equation yields 

£{+(£ U=^{-r[(4a+']} 


After differentiation, we get 


1 d*U 
T dV dT 




d*U 
dT dV 



Taking into account that d 2 UldV dT = d^UldT dV , we 
the formula 



arrive at 
(12.44) 


The latter shows how the internal energy depends on the volume. 
Let us use it to find the internal energy of an ideal and a van der 
Waals gas. 
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For an ideal gas, we have p = RT/V. Hence, ( dp/dT) v = R/V . 
Using this value in Eq. (12.44), we obtain 



The result obtained signifies that the internal energy of an ideal 
gas does not depend on its volume. In Sec. 10.9, we arrived at the 
same conclusion when we assumed that there is no interaction be- 
tween molecules. 

It follows from the equation of state for a van der Waals gas [see 
(10.62)1 that 

RT a 

P v b v* (12.45) 

Hence 

/ dp \ R 

l dT tv V — b 

Using this expression in formula (12.44), we get 

/ dU \ RT a 

\ dV )t~ V — b P— v* 

{see Eq. (12.45)]. Integration with respect to V yields 

U=-± + f(T) 

The function f (T) can be concretized by taking advantage of the 
fact that at V tending to infinity the expression for the internal 
energy of a van der Waals gas must transform into the expression 
for the internal energy of an ideal gas U = C V T. As a result, we 
arrive at the expression U = C V T — a/V which we obtained in 
Sec. 10.13 on the basis of other considerations [see Eq. (10.66)]. 


12.6. Thermodynamic Potentials 

All calculations in thermodynamics are based on the use of func- 
tions of state called thermodynamic potentials. A separate ther- 
modynamic potential corresponds to each set of independent para- 
meters. The changes in the potentials occurring in the course of 
processes determine either the work done by the system or the heat 
received by it. 

In considering thermodynamic potentials, we shall use expres- 
sion (11.112), writing it in the form 

T dS > d'Q 


(12.46) 
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The equality sign relates to reversible processes, the inequality 
sign to irreversible ones. 

The thermodynamic potentials are functions of state. Therefore, 
the increment of any of the potentials equals the total differential 
of the function by which it is expressed. The total differential of the 
function ( x , y) of the variables x and y is determined by the expres- 
sion 


Therefore, if in the course of transformations we get for the incre- 
ment of a quantity f an expression of the kind 

df - X & T1 )dt + Y (I, r,) dr, (12.47) 

then we can state that this quantity is a function of the parameters 
£ and tj, the functions X (£, tj) and Y (£, r,) being the partial deriv- 
atives of the function / (|, r,): 

Internal Energy. We are already well acquainted with one 
of the thermodynamic potentials, namely, the internal energy of 
a system. The expression of the first law for a reversible process can 
be written in the form 

dU = T dS — p dV (12.49) 


A comparison with Eq. (12.47) shows that the variables S and V 
fill the capacity of the so-called natural variables for the poten- 
tial U. It follows from Eq. (12.48) that 



(12.50) 


It can be seen from the relation d'Q = dU + d' A that when 
a body does not exchange heat with surroundings, the work done 
by it is 

d'A = —dU 


or in the integral form 

A — Ui — E/j (heat exchange is absent) (12.51) 

Thus, in the absence of heat exchange with the surroundings, the 
work equals the decrement of the internal energy of a body. 

At constant volume 

d'Q = dU 

Hence, the heat capacity at constant volume is 

/r / dU \ 


(12.52) 
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Free Energy. According to Eq. (12.49), the work done by a body 
in a reversible isothermal process can be written in the form 

d'A = —dU + TdS = —d ( U — TS) (12.54) 

The function of state 

F = U — TS (12.55) 

is called the free (Helmholtz) energy of a body. 

According to Eqs. (12.54) and (12.55), in a reversible isothermal 
process, the work equals the decrement of the free energy of a body: 

d'A = —dF (12.56) 

or 

A = F 1 — F t (T = const, reversible) (12.57) 


A comparison with Eq. (12.51) shows that in isothermal processes 
the free energy plays the same part as the internal energy in adia- 
batic ones. 

We must note that Eq. (12.51) holds for both reversible and irre- 
versible processes. Equation (12.57), on the contrary, holds only for 
reversible processes. For irreversible processes, we have d'Q < T dS 
[see expression (12.46)]. Introducing this inequality into the equa- 
tion d'A = d’Q — dU, it is easy to find that in irreversible iso- 
thermal processes we have 

A <Z F x — F t (7* = const, irreversible) (12.58) 

Hence, the decrement of the free energy determines the upper limit 
of the amount of work that a system can do in an isothermal proc- 
ess. 

Let us take a differential of the function (12.55). Taking Eq. (12.49) 
into account, we get 

dF — T dS — p dV — T dS — S dT = —S dT — p dV (12.59) 


We conclude from a comparison with Eq. (12.47) that T and V are 
the natural variables for the free energy. According to Eq. (12.48) 



(12.60) 


Let us substitute dU + p dV for d'Q in expression (12.46) and 
divide the resulting relation by dt (here t is the time). The result is 


T 


dS ^ dU 
dt ^ dt 



(12.61) 


If the temperature and the volume remain constant, then expres- 
sion (12.61) can be transformed as follows: 


d(U-TS) = *g.^ 0 (T == const, V — const) 

OK at 


(12.62) 
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Examination of this expression shows that an irreversible process 
occurring at constant temperature and volume is attended by a de- 
crease in the free energy of the body. When equilibrium is reached, 
F stops changing with time. Thus, a state for which the free energy 
is minimum is the equilibrium one at constant temperature and 
volume. 

Enthalpy. If a process goes on at constant pressure, then the 
amount of heat received by a body can be represented as follows: 

d'Q = dU + p dV = d (U + pV) (12.63) 

The function of state 

H = U + pV (12.64) 

is called the enthalpy (or heat function). 

It follows from Eqs. (12.63) and (12.64) that the heat received by 
a body in the course of an isobaric process is 

d'Q = dH (12.65) 

or in the integral form 

Q = Il z — H x (12.66) 

Hence, when the pressure remains constant, the heat received by 

a body equals the increment of the enthalpy. 

Differentiation of Eq. (12.64) with account taken of Eq. (12.49) 
yields 

dH = T dS — p dV + p dV + V dp = T dS + V dp (12.67) 

Thus, we conclude that the enthalpy is a thermodynamic potential 
in the variables S and p. Its partial derivatives are 

(- w)s= v < 12 - 68 ) 

The heat capacity at constant pressure in accordance with 
Eq. (12.65) is 

(-fr), < 12 - 69 > 

Comparing Eqs. (12.65) and (12.69) with (12.52) and (12.53), we 
arrive at the conclusion that when the pressure is constant, the 
enthalpy has properties similar to those which the internal energy 
has at a constant volume. 

Gibbs Thermodynamic Potential. This function of state (also 
called the Gibbs energy) is determined as follows: 

G = H — TS = U + pV — TS (12.70) 

Its total differential is [see Eq. (12.67)1 

dG = T dS + V dp — T dS — S dT = V dp — S dT (12.71) 



Thermodynamics 


365 


Consequently, p and T are the natural variables for the function G . 
The partial derivatives of this function are 



T 


= V 9 



(12.72) 


If the temperature and pressure remain constant, expression 
(12.61) can be written in the form 

-- — — g ^ o (7*=: const, p = const) (12.73) 


It can be seen from this expression that an irreversible process going 
on at constant temperature and pressure is attended by a decrease in 
the Gibbs thermodynamic potential. When equilibrium is reached, 
G stops changing with time. Thus, a state for which the Gibbs 
thermodynamic potential is minimum is the equilibrium one at 
constant temperature and pressure [compare with expression (12.62)]. 

Table 12.1 gives the basic properties of the thermodynamic poten- 
tials. 


Table 12.1 


Name and symbol of thermo- 
dynamic potential 

Properties 

Internal energy 
U = U(S , V) 

Ut — U t =A 

u 2 -u^q 

in adiabatic process, 
when V — const 

Free energy 
F = F {T y V) 

Fi — F 3 = A 

F = min 

in reversible isothermal 
process 

for equilibrium state when 
T = const and V = const 

Enthalpy 
II = H(S , p) 

H i —H 1 = Q 

when p= const 

Gibbs thermodynamic potential 
G=G(T, p) 

G = min 

I 

for equilibrium state when 
7 = const and p = const 













CHAPTER 13 THE CRYST ALLIN E STATE 


13.1. Features of the Crystalline State 

The majority of solids in nature have a crystalline structure. 
For example, almost all minerals and all metals in the solid state 
are crystals. 

A feature of the crystalline state distinguishing it from the fluid 
states is the presence of anisotropy, i.e. the dependence of a number 
of physical properties (mechanical, thermal, electrical, optical) 
on the direction. 

Bodies whose properties are identical in all directions are called 
isotropic. In addition to gases and, with a few exceptions, all liquids, 
amorphous solids are also isotropic. These solids are supercooled 
liquids (see Sec. 15.6). 

The reason why crystals are anisotropic is the ordered arrangement 
of the particles they are built of (atoms or molecules). The ordered 
arrangement of the particles manifests itself in the regular external 
facetting of crystals. Crystals are restricted by plane facets making 
angles with one another characteristic of a given species of crystals. 
It is easy to split crystals along definite planes called cleavage 
planes. 

The regularity of the geometrical shape and the anisotropy of 
crystals do not usually manifest themselves because crystalline 
bodies are encountered, as a rule, in the form of polycrystals, i.e. 
conglomerates of a multitude of intergrown, randomly oriented 
fine crystals. Anisotropy is observed in polycrystals only within 
the confines of each separately taken minute crystal. A body as 
a whole does not display anisotropy owing to the chaotic orientation 
of its crystals. By providing special conditions of crystallization 
from a melt or a solution, we can obtain large single crystals — 
monocrystals — of any substance. Monocrystals of some minerals are 
encountered in nature. 

The ordered nature of the arrangement of the atoms (or mole- 
cules) of a crystal consists in that they are located at the points (or 
sites) of a geometrically regular space lattice. The entire crystal 
can be obtained by repeating many times in three different directions 
the same structural element called an elementary (or unit) crystal 
cell (Fig. 13.1a). The lengths of the edges a, b, c of a cell are called 
the translation periods of a crystal. 

An elementary cell is a parallelepiped constructed on the three 
vectors a, b, c whose magnitudes equal the translation periods. This 



The Crystalline State 


367 


parallelepiped, apart from its edges a, b , c, is also characterized 
by the angles a, p, y between the edges (Fig. 13. 1&). The quantities 
a, 6, c, and a, p, y unambiguously define an elementary cell and 
are called its parameters. 



An elementary cell can be selected in various ways. This is illus- 
trated in Fig. 13.2 using an example of a plane structure. The fac- 
ing of a wall with alternating light and dark triangular tiles can be 
obtained by repeating different cells many times in two directions 
(see, for example, cells 7, 2 y 
and 3\ the arrows show the 
directions in which the cells 
are repeated). Cells 1 and 2 
are distinguished by including 
the minimum number of struc- 
tural elements (one light and 
one dark tile each). A crystal 
cell including the smallest 
number of atoms characteriz- 
ing the chemical composition 
of a crystalline substance (for 
example, one oxygen atom 
and two hydrogen atoms for 
an ice crystal) is known as a 
primitive cell. It is customary 
practice, however, to select 
an elementary cell having a greater number of atoms, but with 
the same symmetry as the entire crystal, instead of a primitive 
cell. Thus, the plane structure depicted in Fig. 13.2 coincides 
with itself when rotated through 120 degrees about any axis 
at right angles to it that passes through an apex of a tile. Ele- 
mentary cell 3 has the same property. Cells 1 and 2 have a smaller 
degree of symmetry: they coincide with themselves only when 
rotated through 360 degrees. 



Fig. 13.2 
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13.2. Classification of Crystals 

A crystal lattice can have different kinds of symmetry. By the 
symmetry of a crystal lattice is meant its property to coincide with 
itself upon certain displacements in space. 

Every lattice has translation symmetry first of all, i.e. it coin- 
cides with itself upon displacement over the translation period*. 
Among the other kinds of symmetry, we shall note symmetry with 
respect to rotations about certain axes, and also to mirror reflection 
relative to definite planes. 

If a lattice coincides with itself when rotated about an axis 
through the angle 2 nln (consequently, the lattice coincides with 



itself n times in one complete revolution about the axis), then this 
axis is called an axis of symmetry of the n-th order. It can be shown 
that apart from the trivial axis of the first order, only axes of sym- 
metry of the second, third, fourth, and sixth orders are possible. 
Examples of structures having such axes of symmetry are shown 
schematically in Fig. 13.3 (the white circles, black circles, and 
crosses signify atoms of different species). 

If a lattice coincides with itself when reflected in a certain plane 
as in a mirror, this plane is defined as a plane of symmetry. An 
example of a plane of symmetry is also shown in Fig. 13.3. 

The different kinds of symmetry are called elements of symmetry 
of a crystal lattice. There are other elements of symmetry in addi- 
tion to axes and planes, but we shall not consider them here, however. 

A crystal lattice, as a rule, has several kinds of symmetry at 
a time. Not any combination of the elements of symmetry is possible, 
however. The prominent Russian scientist Yevgraf Fedorov (1853- 
1919) showed that 230 combinations of the symmetry elements are 
possible. These combinations are called space groups. They are 
divided according to features of symmetry into 32 classes. Finally, 


* When considering the symmetry of a lattice, the finite dimensions of the 
crystal are disregarded, and the lattice is considered to be infinite. 
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with respect to the shape of the elementary cell, all crystals are 
divided into seven crystallographic systems, each of which includes 
several classes of symmetry. 

The crystallographic systems are arranged in the order of growth 
of their symmetry as follows. 

1 . Triclinic System. For this system, a b ^ c; <*=5^ P y 
An elementary cell has the form of an oblique parallelepiped. 

2. Monoclinic System. It has two right angles, while the third 
angle (usually the angle P) is not a right one. Hence, a =#= 6 ^ c; 




Fig. 13.5 


a = y = 90°, p =/= 90°. An elementary cell has the form of a right 
prism with a parallelogram as its base (i.e. the form of a right paral- 
lelepiped). 

3. Rhombic System. All the angles are right ones, all the edges 
are different: a =/= b c, a = p = y = 90°. An elementary cell 
has the form of a rectangular parallelepiped. 

4. Tetragonal System. All the angles are right ones, two edges 
are equal: a — b =/= c; a = p = y = 90°. An elementary cell has 
the form of a right prism with a square base. 

5. Rhombohedral (or Trigonal) System. All the edges are equal, 
all the angles are also equal and are other than right ones: a = b = c; 
a — p = y ¥= 90°. An elementary cell has the form of a cube de- 
formed by compression or tension along a diagonal. 

6. Hexagonal System. The edges and the angles between them 
comply with the conditions a = 6 c; a = P = 90°, y — 120°. 
Three elementary cells brought together as shown in Fig. 13.4 form 
a regular hexagonal prism. 

7. Cubic System. All the edges are equal, all the angles are right 
ones: a — b = c; a = (5 = y = 90°. An elementary cell has the 
form of a cube. 
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13.3. Physical Kinds 

of Crystal Lattices 

Four kinds of crystal lattices and accordingly four kinds of crys- 
tals are distinguished depending on the nature of the particles at 
the lattice points and on the nature of the forces of interaction be- 
tween the particles. They are ionic, atomic, metallic, and molecular 
crystals. 

1. Ionic Crystals. Ions of opposite signs inhabit the lattice points.The 
forces of interaction between them are mainly electrostatic (Coulomb). 
The bond due to the electrostatic forces of attraction between 
oppositely charged ions is called a heteropolar (or ionic) bond. 

A typical example of an ionic lattice is that of table salt (NaCl) 
shown in Fig. 13.5. It belongs to the cubic system. The white circles 
depict the positively charged sodium ions, and the black circles the 
negative chloride ions. A glance at the figure shows that the closest 
neighbours of an ion of a given sign will be ones of the opposite 
sign. In the gaseous state, NaCl consists of molecules in which 
sodium ions are combined with chloride ones in pairs. The group of 
an Na ion and a Cl ion forming a molecule loses its isolated existence 
in a crystal. An ionic crystal consists of ions, and not of molecules. 
The entire crystal can be considered as a single giant molecule. 

2. Atomic Crystals. The lattice points accommodate neutral atoms. 
The bond between neutral atoms in a crystal (and also in a molecule) 
is called homopolar (or covalent). The forces of interaction with 
a homopolar bond are also of an electrical (but not of a Coulomb) 
nature. These forces can be explained only on the basis of quantum 
mechanics. 

A homopolar bond is produced by electron pairs. This signifies 
that one electron from each atom participates in setting up a bond 
between two atoms. For this reason, a homopolar bond has a directed 
nature. In a heteropolar bond, each ion acts on all the ions that are 
sufficiently close to it. In a homopolar bond, the action is directed 
toward the atom with which the given one shares an electron pair. 
A homopolar bond can be set up only by valence electrons, whose 
bond to the atom is the weakest. Since each electron can set up 
a bond with only one atom, the number of bonds which a given atom 
can participate in (the number of neighbours with which it can be 
bound) equals its valence. 

Typical examples of atomic crystals are diamond and graphite. 
The chemical nature of these two substances is the same (they are 
both constructed of carbon atoms), but they differ in the structure 
of their crystals. Figure 13.6a shows a diamond lattice, and Fig. 13.66 
a graphite one. This example clearly shows how the crystal structure 
of a substance affects its properties. 
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The typical semiconductors— germanium (Ge) and silicon (Si) 
have the same kind of lattice as diamond (a diamond-type lattice). 
This lattice is characterized by each atom being surrounded by 
four neighbours at an equal distance from it located at the corners 
of a regular tetrahedron. Each of the four valence electrons belongs 
to an electron pair joining this atom with one of its neighbours. 



f«) 


(b) 


Fig. 13.6 


3. Metallic Crystals. Positive ions of the metal are located at all 
the lattice points. Electrons that detached themselves from the 
atoms when ions were formed move chaotically between the latter 
similar to the molecules of a gas. These electrons play the part of 
a “cement” keeping the positive ions together, otherwise the lattice 



(a) (b) (c) 

Fig. 13.7 


would fall apart under the action of the forces of repulsion between 
the ions. At the same time, the electrons, in turn, are retained by 
the ions within the crystal lattice and cannot leave it. 

Most metals have lattices of one of three kinds: the cubic volume- 
centered (Fig. 13.7a), the cubic face-centered (Fig. 13.76), and the 
so-called hexagonal close-packed lattice (Fig. 13.7c). The latter is 
a hexagonal lattice with the ratio da equal to 8/3. The cubic face- 
centered and the hexagonal close-packed lattices correspond to the 
closest packing of identical spheres. 

4. Molecular Crystals. Molecules with a definite orientation are 
located at the lattice points. The forces binding together the mole- 
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cules in a crystal are of the same nature as the forces of attraction 
between molecules leading to gases deviating from ideal ones in 
their properties. This is why they are called van der Waals forces. 
Molecular lattices are formed, for example, by the following sub- 
stances: H 2i N 2 , 0 2 , C0 2 , H a O. Thus, ordinary ice, and also the 
so-called dry ice (solid carbon dioxide) are molecular crystals. 


13.4. Defects in Crystals 


Defects in crystals are violations of an ideal crystalline structure. 
Such a violation may consist in the absence of an atom at a lattice 
point (a vacancy), in the presence of a foreign atom (an impurity 
atom) instead of an atom of the given substance (a host atom), in 



Fig. 13.8 



Fig. 13.9 


the introduction of a surplus atom (host or foreign) into the inter- 
stice. Such defects are called point ones. They cause violations in 
the regularity of a lattice extending over a distance of the order of 
several periods. 

In addition to point defects, there are also defects concentrated 
near certain lines. They are called linear defects or dislocations. 
Defects of this kind violate the regular alternation of the crystal 
planes. The simplest defects of this kind are edge and screw disloca- 
tions. 

An edge dislocation is due to a surplus crystal half-plane inserted 
between two adjacent layers of atoms (Fig. 13.8). The edge of this 
half-plane forms a dislocation of the given kind. The dislocation 
line is the straight line denoted by the symbol at right angles to 
the plane of the drawing. 

A screw dislocation can be presented as a result of cutting a crystal 
along a half-plane and the following shifting of the lattice portions 
at different sides of the cut toward each other over a distance of one 
period (Fig. 13.9). The internal edge of the cut forms a screw disloca- 
tion (see the dash line in the figure). A crystal with a screw disloca- 
tion actually consists of a single crystal plane that is curved along 
a helical surface (such a surface is called a helicoid). The dislocation 
line coincides with the axis of the screw or helix. The crystal plane 
is displaced by one period each time it circumvents this line. 
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We have considered the two simplest (extreme) kinds of disloca- 
tions. In both cases, the dislocation lines are straight. In the gen- 
eral case, these lines may be curved. 

Defects greatly affect the physical properties of crystals, includ- 
ing their strength. In particular, dislocations are the reason why 
the plastic deformation* of real crystals occurs under the action of 
stresses that are several orders of magnitude smaller than the stress 
calculated for ideal crystals. 

Shear along the atomic layers readily occurs in the monocrystals 
of metals. Do not imagine this process in such a way that all the 






(<*) (t>) (C) (</) 

Fig. 13.10 

atoms of a layer are displaced simultaneously as a single whole. 
Actually, the atoms jump over into their new positions in small 
groups, sequentially. Such a sequential movement of the atoms can 
be presented as a dislocation movement. The latter requires stresses 
much smaller than those needed for the displacement of an entire 
atomic layer at a time. Figure 13.10 shows the consecutive steps of 
the process occurring in a crystal under the action of the forces 
causing the shear. The initially present dislocation under the action 
of the stresses set up in the crystal moves along the latter. This 
movement is attended by sequential displacement of the atoms in 
the layer above the dislocation relative to the atoms of the layer 
under it. 

Dislocation movements are prevented by the presence of other 
defects in a crystal, for example, by the presence of impurity atoms. 
Dislocations are also inhibited when they intersect. If the number of 
dislocations and other defects in a crystal is small, the dislocations 
spread virtually without hindrance. As a result, the resistance to 
shear will not be great. An increase in the density of the dislocations 
and a growth in the concentration of the impurities lead to great 
inhibition of the dislocations and stopping of their spreading. As a 
result, the strength of the material grows. For example, the strength 
of iron is increased by dissolving carbon atoms in it (steel is such 
a solution). 

* A plastic deformation is one that remains after the stress causing it is 
removed. 
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Plastic deformation is attended by the destruction of the crystal 
lattice and the formation of a great number of defects preventing 
the spreading of the dislocations. This explains why metals are 
hardened upon their cold working. 

A screw dislocation often appears in the course of the growth of 
a crystal from a solution or melt. The capture of an atom by a smooth 
flat crystal surface is less profitable from the' energy viewpoint and 
is therefore less probable than the attachment of an atom to a step 
on the surface of a crystal with a screw dislocation. This is why it 
is preferable practice to grow crystals with a screw dislocation built 
into them. New atoms attach themselves to the edge of the step, 
owing to which the crystal grows along a spiral. 


13.5. Heat Capacity of Crystals 

The location of particles at the points of a crystal lattice corres- 
ponds to a minimum of their mutual potential energy. When par- 
ticles are displaced from their equilibrium position in any direction, 
a force appears that tends to return them to their initial position. 
As a result, oscillations of the particles begin. Oscillation in an arbi- 
trary direction can be represented as the superposition of oscilla- 
tions in three mutually perpendicular directions. Therefore, three 
vibrational degrees of freedom should be ascribed to every particle 
in a crystal. 

We learned in Sec. 11.5 that an energy equal to two halves of 
kT — one half in the form of kinetic and the other in the form of 
potential energy — falls on the average to each vibrational degree of 
freedom. Consequently, an energy equal to 3 kT falls on the average 
to every particle — every atom in an atomic lattice, every ion in an 
ionic or metallic lattice*. We can find the energy of a mole of a sub- 
stance in the crystalline state by multiplying the mean energy of 
one particle by the number of particles at the points of the crystal 
lattice. The latter number coincides with the Avogadro constant 
Na only for chemically simple substances. For a diatomic substance 
such as NaCl, the number of particles will be 2Na because a mole 
of NaCl contains Na atoms of Na and Na atoms of Cl, for a triatom- 
ic one it will be 3 N A , and so on. 

Restricting ourselves to a consideration of chemically simple 
substances forming atomic or metallic crystals, we can write the 
following expression for the internal energy of a mole of a substance 
in the crystalline state: 

U m = N A 3kT = 3 RT 

* Matters are more complicated for molecular crystals. The molecules in 
addition to translational oscillations also perform torsional oscillations. Apart 
from this, the atoms oscillate inside the molecules. 
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The increment of the internal energy corresponding to elevation 
of the temperature by one kelvin, according to Eq. (12.53), equals 
the heat capacity at constant volume. Hence, 

C v = 3 R (13.1) 

Since the volume of solids changes only slightly when they are heated, 
their heat capacity at constant pressure insignificantly differs from 
the heat capacity at constant vol- 
ume; we can therefore assume that 
C p « C v and speak simply of the 
heat capacity of a solid. 

Thus, Eq. (13.1) states that the 
molar heat capacity of chemically 
simple bodies in the crystalline 
state is the same and equals 3/?. 

This statement is the content of 
the Dulong and Petit law established 
experimentally. This law is obeyed 
with quite a good approximation 
for many substances at room temperature. There are exceptions to 
this law, however. For example, diamond has a heat capacity of 
only about 0.7 R at room temperature. 

Moreover, notwithstanding Eq. (13.1), the heat capacity of crys- 
tals depends on the temperature, this dependence having the nature 
shown in Fig. 13.11. Near absolute zero, the heat capacity of all 
bodies is proportional to T 3 , and only at a sufficiently high temper- 
ature characteristic of each substance does Eq. (13.1) begin to be 
obeyed. For most substances, this already occurs at room temper- 
ature. But for diamond the heat capacity only reaches the value of 
3 R at a temperature of about 1000 °G. 

The strict theory of the heat capacity of solids proposed by A. Ein- 
stein and P. Debye takes into account, first, the quantization of the 
energy of vibrational motion (see Sec. 11.5). Second, the theory takes 
into account that the oscillations of the particles in a crystal lat- 
tice are not independent. This theory, which we shall set out in 
volume 3, is in good agreement with experimental data. In partic- 
ular, for high temperatures, it leads to Eq. (13.1). 


c 



Fig. 13.11 



CHAPTER 14 THE LIQUID STATE 


14.1. The Structure of Liquids 

The liquid state, occupying an intermediate position between 
gases and crystals, combines some features of both of these states. 
In particular, liquids, like crystalline substances, are characterized 
by having a definite volume. At the same time, a liquid, like a gas, 
takes on the shape of the vessel containing it. Further, the crystal- 
line state is characterized by the ordered arrangement of the par- 
ticles (atoms or molecules), whereas from this viewpoint, complete 
chaos reigns in gases. As shown by radiographic studies, liquids also 
occupy an intermediate position with respect to the nature of arrange- 
ment of their particles. The so-called short-range order is observed 
in the arrangement of liquid particles. This signifies that with re- 
spect to any particle, the arrangement of its closest neighbours is 
ordered. But as we move farther and farther away from a given par- 
ticle, the arrangement of other particles relative to it becomes less 
and less ordered, and order in the arrangement of the particles van- 
ishes quite rapidly. In crystals, there is long-range order: the ordered 
arrangement of particles with respect to any particle is observed 
within the limits of an appreciable volume. 

The presence of short-range order in liquids is the reason why 
their structure is called quasicrystalline (crystal-like). 

Owing to the absence of long-range order in them, liquids, with 
a few exceptions, do not display the anisotropy characteristic of 
crystals with their regular arrangement of the particles. Liquids with 
elongated molecules display an identical orientation of their mole- 
cules within a considerable volume, which results in anisotropy of 
their optical and some other properties. Such liquids are known as 
liquid crystals. Only the orientation of the molecules is ordered 
in them, while the mutual arrangement of the molecules, as in con- 
ventional liquids, does not display long-range order. 

The circumstance that the liquid state is especially complicated 
as regards its properties is due to the intermediate position of liquids. 
Therefore, its theory has been developed to a much smaller extent 
than that of the crystalline and gaseous states. To date there is no 
complete and universally recognized theory of liquids. Considerable 
merit in developing a number of problems of the theory of the liquid 
state belongs to the Soviet scientist Yakov Frenkel (1894-1952). 

Frenkel postulates that the thermal motion in liquids has the 
following nature. Each molecule during a certain time oscillates 
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about a definite position of equilibrium. The molecule changes its 
place of equilibrium from time to time, moving in a jump to a new 
position that is at a distance from the previous one of the order of 
the size of the molecules themselves. The molecules thus move only 
slowly inside a liquid, spending part of their time near definite 
places. As picturesquely expressed by Frenkel, the molecules wan- 
der throughout the entire volume of a liquid, leading a nomadic mode 
of life in which brief removals are replaced by relatively long periods 
of settled life. The lengths of these stops vary quite considerably and 
chaotically alternate with one another, but the mean duration of 
oscillations about a single equilibrium position is a definite quan- 
tity for each liquid that sharply diminishes with increasing tem- 
perature. In this connection, elevation of the temperature is attended 
by a great growth in the mobility of the molecules, and this, in 
turn, results in diminishing of the viscosity of the liquid. 

Solids exist that in many respects are closer to liquids than to crys- 
tals. Such substances, called amorphous, do not display anisotropy. 
Only short-range order is encountered in the arrangement of their 
particles. The transition from an amorphous solid to a liquid when 
such a substance is heated occurs continuously, whereas the transi- 
tion from a crystal to a liquid occurs in a jump (this will be treated 
in greater detail in Sec. 15.6). All this gives us grounds to consider 
amorphous solid substances as supercooled liquids whose particles 
owing to the greatly increased viscosity have a limited mobility. 

A typical example of an amorphous solid is glass. Amorphous 
substances also include resins and bitumens. 


14.2. Surface Tension 

The molecules cf a liquid are so close to one another that the forces 
of attraction between them have a considerable value. Since the 
interaction rapidly falls off with the distance, beginning from a cer-. 
tain distance the forces of attraction between the molecules may be 
disregarded. This distance r, as we already know (see Sec. 10.13), 
is called the radiys of molecular action, and a sphere of radius r is 
called a sphere of molecular action. The radius of molecular action 
has a magnitude of the order of several effective diameters of a 
molecule. 

Each molecule is attracted by all its neighbour molecules within 
the limits of the sphere of molecular action whose centre coincides 
with the given molecule. The resultant of all these forces for a mole- 
cule that is at a distance from the surface of the liquid exceeding r 
evidently equals zero on the average (Fig. 14.1). Matters are differ- 
ent if a molecule is at a distance less than r from the surface. Since 
the density of the vapour (or gas with which the liquid has an inter- 
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face) is much smaller than that of the liquid , the part of the sphere 
of molecular action protruding beyond the limits of the liquid will 
be filled with fewer molecules than the remaining part of the sphere. 
As a result, every molecule in a surface layer of thickness r will expe- 
rience a force directed into the liquid. The magnitude of this force 
grows in a direction from the inner to the outer boundary of the 
layer. 

The transition of a molecule from the bulk of a liquid to its surface 
layer is associated with the need to do work against the forces acting 
in the surface layer. This work is done by the molecule at the expense 
of its store of kinetic energy and increases the potential energy of 
_ the molecule, just as the work done by 

"N a body flying upward against the forces 

t fim — x"~xt of the Earth’s attraction increases 

r \ x *9 x x y the potential energy of the body. When 

Jf the molecule returns into the bulk of 

the liquid, the potential energy which 
/x x* x x\ the molecule had in the surface layer 

( * © x | transforms into the kinetic energy of 

\ x x x x x 5r / the molecule. 

v ^ Thus, molecules in a surface layer 

Fig. 14.1 have an additional potential energy. 

The surface layer as a whole has an 
additional energy forming part of the internal energy of the liquid. 

Since the equilibrium position corresponds to a minimum of poten- 
tial energy, a liquid left to itself will take on a shape having the 
minimum surface area, i.e. the shape of a sphere. What we usually 
observe are not liquids “left to themselves”, but liquids sub- 
jected to the action of the Earth’s gravitational forces. In this 
case, the liquid takes on a shape corresponding to a minimum of the 
total energy — that in the field of the gravitational forces and the 
surface energy. 

When the dimensions of a body increase, its volume grows as the 
cube of the linear dimensions, and its surface area only as the square 
of these dimensions. Therefore, the energy in the gravitational field 
proportional to the volume of a body changes more rapidly with 
increasing dimensions of the body than its surface energy. In small 
drops of a liquid, the surface energy plays the predominate part, 
and as a result the drops have a shape close to a spherical one. Large 
drops of a liquid flatten under the action of gravitational forces 
notwithstanding the fact that their surface energy grows. Large 
bodies of a liquid take on the shape of the vessel containing them 
and a horizontal free surface. 


Fig. 14.1 


The presence of surface energy causes a liquid to tend to reduce 
its surface area. The liquid behaves as if it were confined inside an 
elastic stretched out film tending to compress. It must be borne in 
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mind that there is actually no film confining a liquid from the out- 
side. The surface layer consists of the same molecules as the bulk of 
the liquid, and the interaction between the molecules in the surface 
layer is of the same nature as in the interior. The matter is only that 
the molecules in the surface layer have an additional energy in com- 
parison with those in the bulk of the 
liquid. 

Let us mentally separate a part of the 
surface of a liquid confined within a closed 
contour. The tendency of this portion to 
contract results in that it acts on the 
portions bordering on it with forces dis- 
tributed over the entire contour (according 
to Newton’s third law, the external por- 
tions of the surface layer act on the portion 
of the surface being considered with forces 
of the same magnitude, but opposite in 
direction). These forces are called forces of 
surface tension. The force of surface ten- 
sion is directed along a tangent to the sur- 
face of the liquid perpendicularly to the 
portion of the contour it is acting upon. 

Let us denote the force of surface tension per unit of length of 
a contour by a. This quantity is defined as the surface tension. It 
is measured in newtons per metre (in the SI system) or in dynes per 
centimetre (in the cgs system). 

Assume that we have a rectangular frame with a movable side 
confining a film of a liquid (Fig. 14.2). A film is a thin flat volume 
of liquid confined on both sides by a surface layer (see Fig. 14.26 in 
which a cross section of the frame is shown). Owing to the tendency 
of the surface layer to contract, the film will exert a force equal to 
2 <tZ on the movable side. For the latter to be in equilibrium, an ex- 
ternal force / must be applied to it that equals the force tensioning 
the film, i.e. 2 al. Let us assume that the movable side has moved 
very slowly in the direction of action of the force / over the very small 
distance dx. This process is attended by the liquid above the movable 
side doing the work d' A — — 2orZ dx = — a dS , where dS is the 
increment of the area of the surface layer. With such an increase 
of the surface, an additional number of molecules will pass from the 
interior of the liquid to the surface layer, losing their velocity. There- 
fore, if the process proceeded adiabatrcally, the liquid would cool 
slightly. We assumed, however, that the process goes on very slowly 
(reversibly), owing to which the temperature of the film remains 
constant as a result of the inflow of heat from the surroundings. 
Thus, the process will go on isothermally. 


Surface layer 
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We established in Sec. 12.6 that the work done in a reversible 
isothermal process equals the decrement of the free energy [see 
Eq. (12.56)]. We can therefore write that 

d'A= — odS = —dF 

The result obtained signifies that upon an isothermal increase in 
the area of the surface layer by the free energy of the liquid grows 
by dF = o dS . It thus follows that the surface tension a is the addi- 
tional free energy which a unit area of a surface layer has. Accord- 
ingly, cr can be expressed not only in newtons per metre (or dynes 
per centimetre), but also in joules per square metre (or in ergs per 
square centimetre). 

Impurities greatly affect the magnitude of the surface tension. 
For example, soap dissolved in water reduces its surface tension 
almost one-and-a-half times. When NaCl is dissolved in water, on the 
contrary, the surface tension grows. 

With elevation of the temperature, the difference between the 
densities of a liquid and its saturated vapour diminishes (see 
Sec. 15.4). In this connection, the surface tension also decreases. At 
the critical temperature (its definition is given in Sec. 15.4), o 
vanishes. 


14.3. Pressure under 

a Curved Liquid Surface 

Let us consider the surface of a liquid resting on a flat contour 
(Fig. 14.3a). If the surface of the liquid is not flat, its tendency to 
decrease its area leads to the appearance of a pressure apart from that 



Po+Ap Po-Ap 
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Fig. 14.3 

exerted on a liquid with a flat surface. When the surface is convex, 
this additional pressure is positive (Fig. 14.36), and when it is con- 
cave, this pressure is negative (Fig. 14.3c). In the last case, the sur- 
face layer, tending to diminish, stretches the liquid. 

The magnitude of the additional pressure must obviously grow 
with an increasing surface tension a and surface curvature. Let us 
calculate the additional pressure for a spherical surface of a liquid. 
To do this, we shall mentally cut a spherical drop of a liquid with 
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a diametral plane into two hemispheres (Fig. 14.4). Owing to surface 
tension, both hemispheres are attracted to each other with a force 
equal to 

/ la = 2nRo 


This force presses the two hemispheres against each other over the 
surface S = nR 2 and, consequently, produces the additional pres- 
sure 



2n Ra 
nR 2 


2 a 
R 


(14.1) 


The curvature of a spherical surface is the same everywhere and 
is determined by the radius of the sphere /?. It is obvious that the 




smaller the radius R y the greater is the curvature of a spherical sur- 
face. It is customary practice to characterize the curvature of an 
arbitrary surface by the so-called mean curvature, which may differ 
for various points of a surface. 

The mean curvature is determined through the curvature of nor- 
mal sections. A normal section of a surface at a certain point is de- 
fined as the line of intersection of this surface with a plane passing 
through a normal to the surface at the point being considered. For 
a sphere, any normal section is a circle of radius R (here R is the 
radius of the sphere). The quantity H = 1/R gives the curvature of 
a sphere. In the general case, different normal sections passing through 
the same point have different curvatures. It is proved in geometry 
that the half-sum of the reciprocal radii of curvature 



for any pair of mutually perpendicular normal sections has the 
same value. It is exactly this quantity that is the mean curvature of 
a surface at a given point. 

The radii R x and i? 2 in Eq. (14.2) are algebraic quantities. If 
the centre of curvature of a normal section is under a given surface, 
the relevant radius of curvature is positive; if the centre of curvature 



382 


Molecular Physics and Thermodynamics 


is above the surface, the radius of curvature is negative (Fig. 14.5). 
Thus, a curved surface can have a mean curvature equal to zero. 
For this purpose, the radii of curvature R x and i? a must be identical 
in magnitude and opposite in sign. 

For a sphere, we have R x = R 2 = i?, so that by Eq. (14.2) H = 
= 1 /R. Substituting H for l/R in Eq. (14.1), we get 

A p = 2 Ha (14.3) 

The French scientist Pierre Laplace (1749-1827) proved that 
Eq. (14.3) holds for a surface of any shape if by H we understand the 
mean curvature of a surface at the point under which the additional 
pressure is being determined. Introducing the expression (14.2) 
for the mean curvature into Eq. (14.3), we get a formula for the 
additional pressure under an arbitrary surface: 

a p=°(4;+4;) < 14 * 4 > 

It is called the Laplace formula. 

The additional pressure given by Eq. (14.4) causes the level of 
a liquid in a narrow tube (capillary) to change. This is why it is 
sometimes called the capillary pressure. 


14.4. Phenomena on 

Liquid-Solid Interface 

Everything said in Sec. 14.2 about the special conditions in which 
the molecules of a surface layer are also relates completely to solids. 
Hence, solids, like liquids, have a surface tension. 
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Fig. 14.7 


When considering phenomena on the interface between various 
media, we must not forget that the surface energy of a liquid or solid 
depends not only on the properties of the given liquid or solid, but 
also on the properties of the substance with which they have a com- 
mon boundary. Strictly speaking, we must consider the total surface 
energy a la of both substances in contact with each other (Fig. 14.6). 
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Only if one of the substances is gaseous, does not react chemically 
with the other substance, and has a poor solubility in it, can we 
simply speak of the surface energy (or the surface tension) of the 
second liquid or solid. 

If three substances, namely, a solid, a liquid, and a gas, are in 
direct contact with one another (Fig. 14.7), then the entire system 
takes on a configuration corresponding to the minimum of the total 
energy (surface, in the field of forces of gravity, etc.) In particular, 
the contour along which the three substances are in contact is ar- 
ranged on the surface of the solid so that the sum of the projections of 



Fig. 14.8 

all the surface tension forces applied to each contour element onto the 
direction in which the contour element can move (i.e. onto a direction 
tangent to the surface of the solid) equals zero. It can be seen from 
Fig. 14.7 that the condition of equilibrium of a contour element of 
length AZ is 

AZo s , g = AZog, i + A Zo lf g cos 0 (14.5) 

where a 8 ^ g , a 9t i and g are the surface tensions at the solid-gas, 
solid-liquid, and liquid-gas interfaces. 

The angle 0 measured inside the liquid between tangents to 
the surface of the solid and the surface of the liquid is called the 
contact angle. In accordance with Eq. (14.5), we have 

c °s0 = (14.6) 

a l. g 

The contact angle is determined by Eq. (14.6) only provided that 

i< V~ <T 8 : i _L < i (14.7) 

<*hg 

If this condition is not observed, i.e. |a 8tg — a s , i| > <*],*» then 
equilibrium cannot set in at any value of 0. This occurs in two cases. 

1. a 8g > a 8 , 1 + <J lt g. No matter how small the angle 0 is, the 
force o 8% g overbalances the other two (Fig. 14.8a). In this case, the 
liquid flows unlimitedly over the surface of the solid— complete 


384 


Molecular Physics and Thermodynamics 


wetting takes place. The replacement of a solid-gas interface with 
two interfaces — solid-liquid and liquid-gas ones — is advantageous 
from the energy viewpoint. The contact angle is zero in complete 
wetting. 

2. cr 8tl > cr 8tg + 0 ltg . No matter how close to n the angle 0 is, 
the force 0 8 . \ overbalances the other two (Fig. 14.86). In this case, 
the liquid-solid interface contracts into a point, and the liquid sep- 
arates from the surface of the solid — complete non- wetting takes place. 
The replacement of a solid-liquid interface with two interfaces— 
solid-gas and liquid-gas ones — is advantageous from the energy 
viewpoint. In complete non-wetting, the contact angle is n. 

When condition (14.7) is observed, the contact angle may be acute 
or obtuse depending on the relation between a s%s and 0 8 . i. If 0 8fg 
is greater than a B% \ then cos 0 > 0 and the angle 0 is acute (Fig. 14.9a). 



Fig. 14.9 



In this case, partial wetting occurs. If a s . g is smaller than 0 8f i, then 
cos 0 < 0 and the angle 0 is obtuse (Fig. 14.96). In this case, partial 
non-wetting occurs. 

Non- wetting may result in interesting phenomena. It is general 
knowledge that a needle or safety razor blade coated with grease 
can float on the surface of water. It is very simple to explain this, at 
first sight, curious phenomenon on the basis of energy considerations. 
The greased surface of steel is not wetted by water; the steel-water 
interface has a much greater energy than the steel-air or air-water 
ones. The complete submersion of a needle into water is attended by 
an increase in the surface energy from 5o 8#g (steel-air) to the value 
50s. i (steel-water), where 5 is the surface area of the needle. The 
change in the surface energy upon submersion is described by the 
curve E sur shown in Fig. 14.10. The symbol h stands for the height 
of the needle above the bottom of the vessel, h 0 is the height of the 
surface of the liquid above the bottom of the vessel. The dependence 
of the potential energy of the needle in the field of the Earth’s grav- 
itation E g r on h has the form of a straight line passing through the 
origin of coordinates. The total energy E toi = E 8UT + E gr has a min- 
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imum when h = h 0 . This is exactly what permits the needle to 
float on the surface of the water. If we press on the needle and sub* 
merge it to a depth such that the total energy passes through its maxi* 
mum and begins to decrease, then the 

needle will submerge further by itself ■ . . r^— 

and sink. -- 

The possibility of “carrying water in 
a sieve” is explained in a similar way. 

If water does not wet a sieve (this can be Fig. 14.11 

achieved by coating the wires forming 

the sieve with paraffin) and the layer of water is not very thick, 
then a slight displacement of the water level downward (Fig. 14.11) 
will be attended by an increase in the surface energy exceeding in 
magnitude the decrease in the energy in the field of gravitational 
forces. Hence, the water will be retained in the sieve and will not 
spill out. 


14.5. Capillary Phenomena 

The existence Of the contact angle leads to curvature of the surface 
of a liquid near the walls of the vessel containing it. In a narrow tube 
(capillary*) or in a narrow gap between two walls, the entire surface 

is curved. If the liquid wets 
the walls, the surface is con- 
cave, and if it does not wet 
them, the surface is convex 
(Fig. 14.12). Such curved sur- 
faces of a liquid are called 
meniscuses. 

If one end of a capillary is 
immersed in a liquid poured 
into a broad vessel, then the 
pressure under the curved sur- 
face in the capillary will differ 
from that under the flat sur- 
face in the broad vessel by the 
amount A p determined by Eq. (14.4). As a result, the level of the 
liquid in the capillary will be higher than in the vessel if the liquid 
wets it, and lower if the liquid does not wet it. 

The change in the height of the liquid level in narrow tubes or 
gaps has been named capillarity. In the broad meaning of the term, 
capillary phenomena are understood to include all the phenomena 
due to the existence of surface tension. In particular, the pressure 



* The Latin capillus means hair. A capillary is a “tube as thin as a hair*. 
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expressed by Eq. (14.4) and due to surface tension is called, as we 
have already indicated, capillary pressure. 

A difference ft sets in between the level of a liquid in a capillary and 
in a broad vessel such that the hydrostatic pressure p gh is balanced 
by the capillary pressure A p: 




(14.8) 



In this equation, a is the surface tension on the liquid-gas interface, 
and R is the radius of curvature of the meniscus. The latter can be 

expressed through the contact angle 0 and the 
radius of the capillary r. Indeed, examination 
of Fig. 14.12 shows that R = r/ cos 0. Using 
this value in Eq. (14.8) and solving the equa- 
tion obtained relative to ft, we arrive at the 
equation 

( 14 . 9 ) 

In accordance with the fact that a wetting 
liquid rises in a capillary, while a non-wetting 
liquid lowers in it, Eq. (14.9) gives a positive 
h for 0 < ji/2 (because cos 0 > 0), and a nega- 
tive h for 0 > jt/2 (because cos 0 < 0). 

In deriving Eq. (14.9), we assumed that the meniscus has a spher- 
ical shape. The equation for h can also be obtained on the basis of 
energy considerations, and there is no need to make a special assump- 
tion on the shape of the meniscus. The equilibrium position of the 
meniscus will correspond to a minimum energy E of the liquid-capil- 
lary system. This energy consists of the surface energy on the liquid- 
wall, liquid-gas, and wall-gas interfaces, and also of the potential 
energy of the liquid in the field of the Earth’s gravitation. 

Let us find the increment of the energy dE corresponding to an 
increment of the height dh to which a liquid rises in a capillary. 
When the height grows to dh, the surface area of contact of the liquid 
with the wall of the capillary increases by 2jtr dh, owing to which 
the energy receives an increment of 2ix.ro s \ dh. Simultaneously, the 
surface area of contact between the wall and the gas diminishes, 
which is attended by an increment of the energy of — 2 nro, tg dh. 
The potential energy in the field of the Earth’s gravitation acquires 
an increment equal to the force of gravity acting on the hatched vol- 
ume of the liquid (Fig. 14.13) multiplied by h, i.e. equal to 
gpnr*h dh. We may disregard the change in the level of the liquid 
in the broad vessel. Thus, 

dE *= 2nr (o» t i — a, t K ) dh 4- gpnr 2 h dh 
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Hence, 

dE 

-35- = 2 nr (0j, j — a,. g ) + gpnr*h 

Equating this derivative to zero, we obtain the condition of equi- 
librium, from which it follows that 

h = 2 (g, 'p~ qa,l) (14.10) 

According to Eq. (14.6), cr. g — i = a 1( g cos 0. Making this 
substitution in Eq. (14.10) and using simply a instead of 0i (K , we 
get Eq. (14.9). 




CHAPTER 15 PHASE EQUILIBRIA 

AND TRANSITIONS 


15.1. Introduction 

By a phase in thermodynamics is meant a combination of homo- 
geneous parts of a system having identical properties. Let its explain 
what is meant by this definition using the following examples. 
A closed vessel contains water and a mixture of air and water vapour 
above it. Here we have to do with a system consisting of two phases: 
one is formed by the water, and the other by the mixture of air 
and water vapour. If we add a few pieces of ice to the water, then 
all these pieces form a third phase. Different crystalline modifica- 
tions of a substance are also different phases. For instance, diamond 
and graphite are different solid phases of carbon. 

In definite conditions, different phases of the same substance can 
be in equilibrium with one another while being in contact. The equi- 
librium of two phases is possible only within a definite temperature 
interval, and a quite definite pressure p at which equilibrium is pos- 
sible corresponds to each value of the temperature T. Thus, the equi- 
librium states of two phases will be depicted in a p-T diagram by 
the line 

p = /<n (i5.i) 

Three phases of a single substance (solid, liquid, and gaseous, or 
liquid and two solid phases) can be in equilibrium only at single 
values of the temperature and pressure which in the p-T diagram 
correspond to what we call the triple point. This point is at the inter- 
section of the equilibrium curves for the phases taken in pairs. 

It is proved in thermodynamics, in agreement with experiments, 
that the equilibrium of more than three phases of the same substance 
is impossible. 

The transition from one phase to another is usually attended 
by the absorption or liberation of a certain amount of heat called 
the latent heat of transition or simply the heat of transition. Such 
transitions are called phase transitions of the first kind. There are 
also transitions from the crystalline modification to another that 
are not associated with the absorption or liberation of heat. These 
transitions are called phase transitions of the second kind*. We 
shall consider only transitions of the first kind. 

* Phase transitions of the second kind do not exhaust the transitions be- 
tween different crystalline modifications. They include the transition to a super- 
conductive state performed in the absence of a magnetic field, and also the tran- 
sition between the two liquid phases of helium called helium-I and helium-II. 
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15.2. Evaporation and Condensation 

Liquids and solids at any temperature contain a certain number 
of molecules whose energy is sufficient for them to overcome the 
attraction to other molecules, escape from the surface of the liquid 
or solid and to pass over into the gaseous phase. The transition of a 
liquid into the gaseous phase is called vaporization or evaporation, 
and the transition of a solid into the gaseous phase 
is called sublimation. 

All solids sublime to some extent without any 
exception. In some substances such as carbon di- 
oxide, sublimation proceeds at an appreciable rate; 
in other substances, it is so insignificant at ordinary 
temperatures that it is practically not detected. 

In evaporation and sublimation, the fastest mole- 
cules leave a body. As a result, the mean energy 
of the remaining molecules diminishes, and the 
body cools. To maintain the temperature of an eva- 
porating (or subliming) body at a constant value. Fig. 15.1 

heat must continuously be supplied to it. The heat 
L that must be supplied to a unit mass of a substance to transform 
it into a vapour at the same temperature which the substance had 
prior to evaporation is defined as the specific heat of vaporization 
(or sublimation). 

In condensation, the heat for evaporation is returned: the liquid 
(or solid) formed upon condensation is heated - . 

Let us consider the setting in of equilibrium between a liquid and 
its vapour. We shall take a sealed vessel partly filled with a liquid 
(Fig. 15.1) and assume that initially the substance was completely 
removed from the space above the liquid. Owing to evaporation, the 
space above the liquid will become filled with molecules. The mole- 
cules that passed into the gaseous phase move chaotically and col- 
lide with the surface of the liquid. Some of these collisions will be 
attended by transition of the molecules into the liquid phase. The 
number of molecules passing in unit time into the liquid phase is 
evidently proportional to the number of molecules colliding with 
its surface. This number, in turn, is proportional to n(u) [see 
Eq. (11.23)], i.e. grows with increasing pressure p. Hence, evaporation 
is accompanied by the reverse process of transition of the molecules 
from the gaseous to the liquid phase, its intensity growing as the 
density of the molecules in the space over the liquid increases. 
When a certain quite definite (for the given temperature) pressure 
is reached, the number of molecules escaping from the liquid will 
become equal to that returning to it. Beginning from this moment, 
the density of the vapour stops changing. Mobile equilibrium sets 
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in between the liquid and its vapour, and it will exist until the 
volume or the temperature of the system changes. A vapour in equi- 
librium with its liquid is called saturated. The pressure at which 
equilibrium is observed is called the saturated vapour pressure. 

The number of molecules leaving a liquid in unit time grows very 
rapidly with the temperature. The number of molecules colliding 
with the surface of the liquid depends on the temperature to a small- 
er extent (through (t>> in proportion to V~T). Therefore, upon ele- 
vation of the temperature, equilibrium between the phases is violat- 
ed, and during a certain time the stream 
of molecules travelling in the direction 
liquid vapour will exceed their stream 
in the direction vapour -> liquid. This 
continues until the increase in the pres- 
sure again leads to the setting in of 
mobile equilibrium. Thus, the pressure 
at which mobile equilibrium sets in 
between a liquid and its vapour, i.e. the 
saturated vapour pressure, is found to 
depend on the temperature. The form of 
this relation is shown in Fig. 15.2. The 
meaning of the symbols T„ and p er 
will come to light in Sec. 15.4. 

If we increase the volume of the vessel, 
the vapour pressure will drop, and equi- 
librium will be violated. As a result, an 
additional amount of the liquid will transform into a vapour to 
make the pressure equal to p 8 . T again. Similarly, reduction of the 
volume will cause a certain amount of the vapour to transform 
into a liquid. 

Everything said above about equilibrium between a liquid and 
a gas also holds for a solid-gas system. A definite value of the pres- 
sure at which mobile equilibrium sets in between the solid and the 
gas corresponds to every temperature. For many bodies such as so- 
lid metals, this pressure at ordinary temperatures is so small that 
it cannot be detected by the most sensitive instruments. 


Ps.v 



15.3. Equilibrium Between a Liquid 
and Its Saturated Vapour 

Let us consider the compression of a substance at a constant tem- 
perature. Assume that the substance is initially gaseous. First, the 
pressure of the gas will grow with decreasing volume (Fig. 15.3). 
When the volume V e is reached, the pressure stops changing, and 
the substance stops being homogeneous — part of the gas condenses 
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into a liquid. The substance stratifies into two phases: a liquid and 
a gaseous one. A further reduction of the volume is attended by more 
and more of the substance passing over into the liquid phase, the 
transition occurring at a constant pressure p 8 .v (the saturated vapour 
pressure). After condensation of the substance terminates (this occurs 
when the volume V\ q is reached), a further reduction in the volume 
begins to be attended by a rapid 
growth of the pressure. 

In Fig. 15.3, V g is the volume occu- 
pied by the substance in the gaseous 
state at the pressure p SYy and V Xq is 
the volume of the substance in the 
liquid state at the same pressure. At 
any intermediate value of the volume 
V , part of the substance with the mass 
m lq will be in the liquid state, and part 
with the mass m Y in the vapour 
state. Let us find the ratio m lq /m v . 

We shall call the volume of a unit mass of a substance its specific 
volume V'. Thus, if the mass of a substance is m, then the specific 
volumes of its saturated vapour and liquid at the pressure p 9mT 
will be 


P 



Fig. 15.3 



(15.2) 


In the state when the mass of the liquid phase is m lq and that of the 
vapour is m v , the liquid will occupy the volume V\ q wii q , and the 
saturated vapour, the volume V Y m Y . The sum of these two volumes 
must equal the volume V : 

F«F{ q m lq + nrov 


Introducing into this equation expressions (15.2) for the specific 
volumes and substituting the sum m\ q + m y for the mass m , we 
get 

V = V, miq if — 

lq m lq+ m r * g ™lq+* 


■ m v 


Hence, 


m iq Vg — y y~ 

my V — V\ q x 


(15.3) 


(see Fig. 15.3). Thus, the ratio of the masses of the liquid and the 
saturated vapour in a two-phase state equals the ratio of the lengths 
into which the point depicting the state divides the horizontal 
portion of the isotherm. 

It must be noted that at temperatures far from the critical one 
(the critical temperature will be treated in the following section) 
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the difference between the volumes of a liquid and its vapour is 
much greater than that shown in Fig. 15.3. For example, the speci- 
fic volume of saturated water vapour at 100 °C is 1600 times that of 
the specific volume of liquid water at the same temperature. 

Thus, a horizontal portion of an isotherm corresponds to the 
states of equilibrium between a liquid and its saturated vapour in a 
p-V diagram. This result is common for all two-phase states — a 
horizontal portion corresponds to a two-phase system on an isotherm 
depicted in the variables p and V. The ends of this portion corres- 
pond to the volumes F x and F 2 occupied by the substance in the 
first and second phases. These phases may be a liquid and its satu- 
rated vapour, or a liquid and crystals (see Fig. 15.14), or, finally, 
two crystalline modifications of the same substance. In all cases, 
an equation similar to (15.3) holds: 

m t V 2 -V 

m s ~ V-V 1 

(ntj and m 2 are the masses of the substance in the first and second 
phases). 


15.4. The Critical State 

Figure 15.4 gives isotherms for several values of the temperature. 
A glance at the figure shows that the horizontal portion of the iso- 
therm diminishes in length with elevation of the temperature, and 
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contracts into a point at the temperature T er called the critical one. 
The difference between the specific volumes diminishes accordingly, 
and together with it the difference between the densities of the liquid 
and its saturated vapour. This difference vanishes completely at the 
critical temperature. Simultaneously, any difference between a liq- 
uid and its vapour vanishes. The temperature dependence of the 
density of a liquid and its saturated vapour is shown in Fig. 15.5. 
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Point C is the limit which the horizontal portions of the isotherms 
tend to when the temperature tends to its critical value T cr . It is 
called the critical point. The state depicted by point C is defined 
as the critical state of a substance. The volume F cr , pressure p cr , 
and temperature T a corresponding to the critical state are called 
critical quantities. Point C is a point of inflection for the critical 
isotherm. A tangent to the isotherm at point C is parallel to the 
F-axis. 

It can be seen from Fig. 15.4 that the saturated vapour pressure 
grows with the temperature and reaches the value p CT at the critical 



temperature. At temperatures above the critical one, the concept 
of a saturated vapour loses its meaning. Therefore, the curve show- 
ing the temperature dependence of the saturated vapour pressure 
terminates at the critical point (see Fig. 15.2). 

If we draw a line through the extreme points of the horizontal 
portions of the isotherms (Fig. 15.4), we get a bell-shaped curve 
confining the region of two-phase states of a substance. At above 
critical temperatures, a substance is homogeneous at any pressure. 
At such temperatures, a substance cannot be liquefied, no matter 
what pressure is applied to it. 

The concept of the critical temperature was first introduced in 
1860 by the Russian scientist Dmitri Mendeleev (1834-1907). He 
called it the temperature of absolute boiling of a liquid and consid- 
ered it as the temperature at which the forces of cohesion between the 
molecules vanish, and a liquid transforms into a vapour regardless 
of its pressure and the volume it occupies. 

The bell-shaped curve and the portion of the critical isotherm to 
the left of point C divide the p-V diagram into three regions 
(Fig. 15.6). The diagonal hatching shows the region of homogeneous 
liquid states of a substance. Under the bell-shaped curve is the re- 
gion of two-phase states, and, finally, the region to the right of the 
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bell-shaped curve and the upper branch of the critical isotherm is 
the region of homogeneous gaseous states of a substance. In the lat- 
ter region, we can earmark the part under the right-hand branch 
of the critical isotherm and call it the vapour region. Any state in 
this region differs from the other gaseous states in that upon iso- 
thermal compression the substance which was originally in this state 
is liquefied. The substance in one of the states at a temperature 
above the critical one cannot be liquefied, no matter what pressure 
is applied. It is not customary practice to divide the gaseous states 
into a gas and a vapour. 

Having selected a transition process so that it does not intersect 
a two-phase region (Fig. 15.7), we can ensure a transition from the 
liquid state to the gaseous one (or vice versa) without separation 
of the substance into two phases. In this case, the substance will 
remain homogeneous all the time in the course of the transition 
process. 

15.5. Supersaturated Vapour 
and Superheated Liquid 

Section 10.13 gives Eq. (10.62) proposed by van der Waals to 
describe the state of gases at high densities. Figure 15.8 depicts van 
der Waals isotherms, i.e. curves described by Eq. (10.62) for sever- 
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al temperatures. A characteristic of these isotherms is the fact that 
at temperatures not exceeding the value the curves have an 
5-shaped bend in whose region three different values of the volume 
correspond to a given pressure. Real isotherms (see Fig. 15.4) do 
not have such a bend, but have a straight horizontal portion instead 
of it. In Fig. 15.9, a real isotherm and a van der Waals isotherm are 
superposed on one another. The van der Waals equation describes 




Phase Equilibria and Transitions 


395 


the path of the isotherm quite well at volumes exceeding V g . At 
volumes smaller than V lq , the path of a real isotherm also approxi- 
mately follows the van der Waals equation. Thus, this equation 
covers not only the gaseous, but also the liquid state of a substance. 

It can be seen from a comparison of a van der Waals isotherm 
with a real one that these isotherms approximately coincide on por- 
tions corresponding to one-phase states of a substance, but behave 
absolutely differently in the region of separation into two phases. 
Instead of the 5-shaped bend on the van 
der Waals isotherm, the real isotherm has p 
a straight horizontal portion in this region 
which is located so that the hatched areas 
1 and 2 enclosed by the bend (Fig. 15.9) 
are the same. 

Separation into two phases is explained 
by the lack of stability of the homoge- 
neous states corresponding to bend 1-2-3-4 
(Fig. 15.10). The instability of the states 
between points 2 and 3 becomes obvious if 
we take into account that the derivative 
dpIdV is positive on this part of the bend. Fig. 15.10 

Hence, a substance capable of passing con- 
secutively through states 2-3 would have absolutely unnatural 
properties: an increase in the volume of the gas would be attended 
by a growth in pressure instead of a reduction in it. 

The derivative dpIdV is negative on parts 1-2 and 3-4 , so that it 
would seem possible for these portions of the curve to be realized. 
Indeed, in certain conditions, the states corresponding to these 
portions can be achieved. True, they are not fully stable: for exam- 
ple, it is sufficient for a dust particle to get into the vapour in state 
A for the substance to break up into two phases and pass over into 
state B (see the transition A B shown by the arrow in Fig. 15.10). 
Such not fully stable states are called metastable. The substance 
in states 1-2 is called a superheated liquid, and in states 3-4 is cal- 
led a supersaturated vapour. 

At sufficiently low temperatures, the bottom part of the bend in 
the van der Waals isotherm crosses the V-axis and passes into the 
region of negative pressures (see the bottom isotherm in Fig. 15.10). 
A substance under a negative pressure is obviously in a state of 
tension instead of compression. Such states can also be realized in 
certain conditions. Thus, portion 5-6 on the bottom isotherm cor- 
responds to a superheated liquid, and 6-7 to a tensioned liquid. 

Let us consider the conditions in which metastable states can be 
brought about. We shall begin with a supersaturated vapour. If 
a vapour contains absolutely no foreign inclusions, its condensation^ 
into a liquid cannot begin. For a droplet to form, a great number of* 
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molecules must simultaneously approach one another to a distance 
of the same order as the distances between the molecules in the li- 
quid, and this is absolutely improbable. For condensation to com- 
mence, the presence of so-called condensation centres is needed, which 
capture the molecules flying toward them and transfer them into 
the condensed phase. Dust particles, liquid droplets, and, especially, 
charged particles (ions) can be condensation centres. 

Thus, if a vapour is thoroughly purified of foreign inclusions and 
ions, it can be at a pressure exceeding the saturated vapour pressure 
Pa. v at the given temperature. This state will be metastable: it is 
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sufficient for even one condensation centre to appear, and the state 
of a supersaturated vapour will be violated — the substance will pass 
over into a two-phase state. 

In practice, a supersatinrated vapour can be obtained by subject- 
ing one that is not supersaturated to sharp expansion. The rapid 
expansion occurs without heat exchange with the surroundings and 
is attended by cooling of the vapour. The point depicting the state 
of the vapour moves along an adiabat. The latter, as was shown in 
Sec. 10.10, is steeper than an isotherm. Hence, the vapour can pass 
over from stable state 1 corresponding to the temperature T x 
(Fig. 15.11) to metastable state 2 corresponding to the lower temper- 
ature r 2 . Such a process is used in a Wilson cloud chamber — a de- 
vice intended for observing the traces of charged particles (for exam- 
ple, alpha particles). The air saturated with water or alcohol vapour 
contained in a Wilson chamber is sharply expanded. The result 
is cooling of the air, and the vapour becomes supersaturated. A par- 
ticle flying into the chamber ionizes the molecules along its path. 
The supersaturated vapour condenses on the ions produced in mi- 
nute droplets and forms a well visible trace. 

Let us consider the conditions for obtaining a superheated liquid. 
The process of violent vaporization (i.e. boiling) can occur, like 
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the process of condensation, on foreign inclusions, for example, on 
sand particles or gas bubbles dissolved in the liquid. If a liquid is 
thoroughly purified of solid inclusions and dissolved gases, then by 
heating it can be brought into a state with a pressure below p 8 . v 
at a given temperature without the liquid boiling. This will be the 
state of a superheated liquid. 

The transition of a liquid from its conventional state to a super- 
heated one is shown in Fig. 15.12 (see transition 1-2 shown by the 
arrow). The state of a superheated liquid is metastable. It is suffi- 
cient to throw a sand particle into a superheated liquid for the latter 
to boil and the substance to pass over into the stable two-phase 
state (see transition C-D in Fig. 15.10). 

A tensioned liquid, for example, mercury, can be obtained as 
follows. If we submerge a long glass tube soldered at one end into 
mercury and, after turning it with its soldered end upward, care- 
fully pull it out from the mercury, then we can get a column of mer- 
cury in the tube that considerably exceeds 760 mm. Hence, the 
mercury will be kept in the tube not by the force of atmospheric 
pressure, but by the cohesion between its molecules. The mercury in 
the tube will be in a state of tension, i.e. under a negative pressure. 


15.6. Melting and Crystallization 

The transition of a crystalline body to the liquid state takes place 
at a definite temperature for every substance and requires the expen- 
diture of a certain amount of 
heat called the heat of fusion. 

If a substance originally in the 
crystalline state receives the 
same amount of heat every 
second, its temperature will 
change with time as shown in 
Fig. 15.13. First the temperature 
of the body will constantly grow. 

When the melting point T m is 
reached (point 1 in Fig. 15.13), 
the temperature of the body 
will stop changing although the 
supply of heat to it is continued. At the same time, the process 
of melting of the solid body begins, during which new and new 
portions of the substance transform into a liquid. After the 
melting process is completed and all of the substance melts 
(point 2 in Fig. 15.13), the temperature again begins to rise. 

The heating curve of an amorphous body is different (see the dash 
curve in Fig. 15.13). Upon the uniform supply of heat, the tempera- 
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ture of an amorphous body continuously grows. Amorphous bodies 
have no definite temperature of transition to the liquid state. This 
transition occurs continuously, and not in a jump. We can only 
indicate the temperature interval within which a body softens. The 
explanation is that liquids and amorphous bodies differ only in the 
degree of mobility of their molecules — amorphous bodies, as al- 
ready indicated, are greatly supercooled liquids. 

The melting point depends on the pressure. Thus, the transition 
from a crystalline to a liquid state occurs in quite definite conditions 
characterized by the values of the pressure and temperature. A curve 
in a p-T diagram called the melting or fusion curve corresponds to a 

combination of these values. The melting 
curve is very steep. To change the melting 
point of ice by one kelvin, for example, the 
pressure has to be changed by 132 at. 

A point on the melting curve determines the 
conditions in which the crystalline and the 
liquid phases can be in equilibrium with 
each other. Such equilibrium is possible with 
any ratio between the masses of the liquid 
and the crystals, i.e. at values of the volume 
of the system within the limits from mVg to mV | q , where m is the 
mass of the system, and V' e and Fj'q are the specific volumes of the 
solid and the liquid phases. Consequently, in a p-V diagram, a por- 
tion of a horizontal straight line (Fig. 15.14) corresponds to every 
point of a melting curve. Since the substance in the states depicted 
by points on this line has the same temperature, straight line 1-2 
in Fig. 15.14 is a portion of an isotherm corresponding to two-phase 
states of the substance (compare with the horizontal portions of 
the isotherms in Fig. 15.4). 

The process of crystallization that is the reverse of melting pro- 
ceeds as follows. When a liquid is cooled to a temperature at which 
the solid and liquid phases can be in equilibrium at the given pres- 
sure (i.e. to the same temperature at which melting occurs), the 
simultaneous growth of minute crystals begins about the so-called 
nuclei or centres of crystallization. Growing larger and larger, the 
separate crystals in the long run join one another, forming a poly- 
crystalline solid. 

Solid particles suspended in the liquid can be tbe crystallization 
centres. A liquid thoroughly purified of such particles can be cooled 
to below the freezing point without the formation of crystals begin- 
ning. The state of such a supercooled liquid is metastable. It is usu- 
ally sufficient for a dust particle to get into such a liquid for it to 
break up into a liquid and crystals at the equilibrium temperature. 
Sometimes upon great supercooling, however, the mobility of the 
liquid molecules is so insignificant that the metastable state can be 
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preserved for a very long time. The liquid in such cases has a very 
low fluidity and is an amorphous solid. 

The process of crystallization is attended by the liberation of the 
same amount of heat as that absorbed in melting. 


15.7. The Clapeyron-Clausius Equation 

We saw in the preceding sections that any two phases of a sub- 
stance can be in equilibrium only at a definite pressure whose magni- 
tude depends on the temperature. We can obtain the general form 
of this relation by resorting to the concept of entropy. For this pur- 
pose, we shall consider a Car- 
not cycle for a system consist- p 

ing of two phases of a given 
substance in equilibrium. 

In a p-V diagram, the Car- 
not cycle for a two-phase sys- 
tem has the form shown in 
Fig. 15.15 (the temperatures 
of the high temperature and 
low temperature reservoirs are 
assumed to differ by the very 
small value AT). The numbers 
1 and 2 denote the extreme 
points of the horizontal por- 
tion of the isotherm of tem- 
perature T . States 1 and 2 are 

one-phase ones. All the intermediate points on 1-2 depict two- 
phase states differing from each other in the distribution of the mass 
of the substance between the first and the second phases. 

The isothermal process A -*■ B is attended by a phase transition 
of a certain mass m of the substance. The volume of the substance 
receives an increment of m ( F' — F'), where F' and F' are the 
specific volumes of the first and second phases. For such a transition 
to occur, the heat Q t equal to mL 12 must be supplied to the sub- 
stance (L 12 is the specific heat absorbed in the transition from state 
1 to state 2 at the temperature T). The heat Q r is the heat which the 
system receives in the course of the cycle from the high temperature 
reservoir. The heat is given up to the low temperature reservoir in 
the course of the isothermal process C-+D. The heat given up amounts 
to Q' 2 = m'L[ 2 , where L[ 2 is the heat of transition 1-2 at the tem- 
perature T — A T, and mf is the mass of the substance experiencing 
a phase transition in the course of the process C -*■ D. This value 
differs somewhat from m because a certain mass of the substance 
undergoes phase transitions in the course of adiabatic processes. 
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On the isothermal portion A-B, the entropy of the system receives 
the increment A S x equal to QJT- On the isothermal portion C-D , 
the increment of the entropy is A S a = — Q'J(T — A T). The entro- 
py does not change in the course of the adiabatic processes B-C and 
D-A. The total increment of the entropy during a cycle is zero. 
Hence, 

A5 1 +A5 2 =^- t ^ f *0 

whence 

(<?i-<?i)r=<?iAr (15.4) 

According to Eq. (12.3), Q t — Q' t equals the work done during 
a cycle. This work can be found by calculating the area of the cycle. 
Approximately, this area can be considered equal to m (F' — V*) A p 
(see Fig. 15.15). We thus arrive at the relation 

Q x -Q t **m (F; -V[) Ap (15.5) 

In the limit when A p tends to zero (for which it is necessary that 
AT also tend to zero), expression (15.5) transforms into a strict 
equality. 

Let us substitute expression (15.5) in Eq. (15.4) for Q t — Q' t , 
and also mL lz for Q t . As a result, we get 

m (F; — F,) TAp « mL lz AT 

Hence, 

AP Lit 

A T ~ T(Vi — V [) 


Finally, performing the limit transition AT 
strict equation 

dp L u 

dT ~ T(Vi—V{) 


0, we arrive at the 


(15.6) 


This expression is called the Clapeyron-Clausius equation. It re- 
lates the temperature derivative of the equilibrium pressure to the 
heat of transition, the temperature, and the difference between the 
specific volumes of the phases in equilibrium. 

According to Eq. (15.6), the sign of the derivative dpIdT depends 
on what change in the volume — an increase or a reduction — attends 
a phase transition occurring with the absorption of heat. In the 
evaporation of a liquid or a solid, the volume always grows, there- 
fore dp/dT for a vaporization curve, and also for a sublimation one, 
can only be positive: elevation of the temperature leads to an in- 
crease in the equilibrium pressure. 

The volume grows, as a rule, in melting, so that dpIdT > 0: an 
increase in the pressure raises the melting point. F or some substances 
including water, however, the volume of the liquid phase is less 
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than that of the solid phase (V' < V [) )*. In this case, dp/dT <0 — 
an increase in the pressure is attended by lowering of the melting 
point. We can melt ice by applying a high pressure to it without 
raising its temperature above 0°C. 

The temperature of transition from one crystalline modification 
to another will rise or lower with increasing pressure depending on 
which of the solid phases has a greater specific volume. 


15.8. Triple Point. Phase Diagram 

Let us take a substance in the form of a liquid and its saturated 
vapour in equilibrium with it and withdraw heat from it without 
changing the volume. This process will be attended by lowering of 
the temperature of the substance and a corresponding reduction in 
the pressure. Therefore, the point depicting the state of the sub- 
stance in a p - T diagram will move 
downward along the vaporization 
curve (Fig. 15.16). This will contin- 
ue until the freezing point of the 
substance is reached corresponding 
to the equilibrium pressure value. 

Let us denote this temperature 
by T tr . The temperature and the 
pressure remain constant all the 
time the freezing process goes on. 

The heat removed during this 
process is the heat liberated in 
freezing (crystallization). 

The temperature T tr and the 
equilibrium pressure ptr correspond- 
ing to it are the only values of the temperature and pressure at 
which three phases of a substance — the solid, liquid, and gaseous 
ones — can be in equilibrium. The corresponding point in a p-T 
diagram is called a triple point. Thus, a triple point determines the 
conditions in which three phases of a substance can be in equilibrium 
simultaneously. 

Upon completion of the freezing process, the solid and gaseous 
phases will be in equilibrium. If we continue to remove heat from 
the substance, the temperature will again begin to lower. The pres- 
sure of the vapour in equilibrium with the crystalline phase will 
decrease accordingly. The point depicting the state of the substance 
will move downward along the sublimation curve. 


* The volume of water is known to increase when it freezes. For this reason, 
ice has a smaller density than water. 
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The temperature of a triple point is the temperature at which 
a substance melts when it is under the pressure p Xr . At other pressures, 
the melting point will be different. The relation between the 
pressure and the melting point will be depicted by the melting curve 
beginning at the triple point. Thus, a triple point is on the inter- 
section of three curves determining the conditions of equilibrium 
of two phases: solid and liquid, liquid and gaseous, and, finally, 
solid and gaseous. 

Depending on the relation between the specific volumes of the 
solid and liquid phases, the melting curve is directed either as shown 
in Fig. 15.16 (dp/dT > 0) or as shown in Fig. 15.17 (dpIdT < 0). 


P 



Fig. 15.17 Fig. 15.18 


The melting, vaporization, and sublimation curves divide the 
coordinate plane into three regions. To the left of the sublimation 
and melting curves is the region of the solid phase, between the mel- 
ting and vaporization curves is the region of liquid states, and, 
finally, to the right of the vaporization and sublimation curves is 
the region of gaseous states of the relevant substance. Any point in 
one of these regions depicts the corresponding one-phase state of 
the substance* (we always have in view only equilibrium states, i.e. 
states in which a substance can be as long as desired in unchanging 
external conditions). A point on one of the curves separating the re- 
gions depicts a state of equilibrium of the two relevant phases of the 
substance. The triple point depicts the state of equilibrium of all 
three phases. Thus, each point in the diagram depicts a definite 
equilibrium state of the substance. Such a diagram is called a phase 
diagram. 

The phase diagram is more complicated for a substance having 
several crystalline modifications. Figure 15.18 shows a diagram for 
the case when the number of different crystalline modifications is 
two. There are two triple points in this case. The liquid, gas, and 
the first crystalline modification of the substance are in equilibrium 
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at point Tr, while the liquid and both crystalline modifications are 
in equilibrium at point Tr ' . 

A phase diagram for each particular substance is plotted on the 
basis of experimental data. Knowing the phase diagram, we can 
predict the state in which a substance will be in various conditions 
(at various values of p and T) y and also the transformations which 
the substance will undergo in different processes. 

The following examples will explain this. If we take a substance 
in the state corresponding to point 1 (see Fig. 15.16) and subject it 
to isobaric heating, then the substance will pass through the sequence 
of states shown by the dash straight line 1-2 , namely, crystals- 
liquid-gas. If we take the same substance in the state depicted by 
point 3 and also subject it to isobaric heating, then the sequence of 
states (dash line 3-4) will be different, namely, the crystals transform 
directly into a gas without passing through the liquid phase. 

It can be seen from the phase diagram that the liquid phase can 
exist in an equilibrium state only at pressures not lower than that 
of the triple point (the same relates to the solid phase II in 
Fig. 15.18). At pressures below p tT9 only supercooled liquids are 
observed. 

The triple point of most ordinary substances is considerably lower 
than atmospheric pressure. Hence, the transition of these substances 
from the solid state to the gaseous one occurs through the interme- 
diate liquid phase. For instance, a pressure of 4.58 mmHg and a 
temperature of 0.0075 °C correspond to the triple point of water. 
For carbon dioxide, the pressure at the triple point is 5.11 atm 
(the temperature of the triple point is — 56.6°G). Therefore at atmo- 
spheric pressure, carbon dioxide can exist only in the solid and the 
gaseous states. Solid carbon dioxide (dry ice) transforms directly 
into a gas. The sublimation point of carbon dioxide at atmospheric 
pressure is — 78 °C. 

If the specific volume of crystals exceeds the specific volume of 
the liquid phase, then the behaviour of the relevant substance in 
some processes may be quite peculiar. Let us take, for example, 
such a substance in the state depicted by point 1 (see Fig. 15.17), 
and subject it to isothermal compression. The pressure grows in 
this case, and the process is depicted in the diagram by a vertical 
straight line (see dash line 1-2). In the course of the process, the 
substance passes through the following sequence of states: gas- 
crystals-liquid. Such a sequence is evidently observed at tempera- 
tures below that of the triple point. 

In concluding, we shall note another feature of a phase diagram. 
The vaporization curve terminates at the critical point C. Hence, 
a transition from the region of liquid states to that of gaseous states 
is possible around the critical point without intersecting the vapori- 
zation curve (see transition 3-4 in Fig. 15.17 depicted by the dash 
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curve). Figure 15.11 shows how such a transition looks in a p-V 
diagram. In this case, the transition from the liquid state to the 
gaseous one (and vice versa) is performed continuously through a 
sequence of one-phase states. It must be noted that the entire hori- 
zontal portion of the relevant isotherm in Fig. 15.11 corresponds to 
the point with the coordinate T taken on the vaporization curve. 

The continuous transition between the liquid and the gaseous 
states is possible because the difference between them is more of a 
quantitative than of a qualitative nature. In particular, anisotropy 
is absent in both these states. A continuous transition from the cry- 
stalline state to the liquid or gaseous one is impossible because we 
know that the crystalline state is featured by anisotropy. A transition 
from a state with anisotropy to one without it can be performed, how- 
ever, only in a jump. Anisotropy cannot be present only partly — 
either it is present or it is absent, and there is no third possibility. 
This is why the sublimation and the melting curves cannot terminate 
at the critical point like the vaporization curve does. The sublima- 
tion curve terminates at the point p = 0 and T = 0, and the melting 
curve extends to infinity. 

In exactly the same way, a continuous transition from one cry- 
stalline modification to another is impossible. Different crystalline 
modifications of a substance differ in their inherent elements of 
symmetry. Since a symmetry element may only be either present 
or absent, the transition from one solid phase to another is possible 
only in a jump. For this reason, the equilibrium curve of two solid 
phases, like the melting curve, extends to infinity. 




CHAPTER 16 PHYSICAL KINETICS 


16.1. Transport Phenomena 

Statistical physics has to do with equilibrium states and reversible 
processes (i.e. processes in which a system passes through a sequence 
of equilibrium states). The science studying the processes that are 
set up when equilibrium is violated is called physical kinetics. 

When equilibrium of a system is violated, the system tends to 
return to its equilibrium state. This process is attended by a growth 
in entropy and is consequently irreversible. Thus, the processes 
which physical kinetics studies are irreversible. 

The violation of equilibrium is accompanied by the appearance 
of flows of either molecules, or heat, or an electric charge, and so 
on. In this connection, the relevant processes are called transport 
phenomena. It follows from the above that transport phenomena 
are irreversible processes. 

We shall consider three transport phenomena — diffusion, thermal 
conductivity, and internal friction or viscosity. We shall deal only 
with the cases when deviations from equilibrium are not consider- 
able. First, we shall write empirical equations of these processes appli- 
cable to any media (solid, liquid, and gaseous). In the following sec- 
tions, we shall give the molecular-kinetic derivation of these equa- 
tions for gases. 

When considering transport phenomena, we shall have to calcu- 
late the amounts of various quantities (the number of particles, 
mass, energy, momentum) transported through an imaginary surface. 
The amount of a quantity passing in unit time through a surface 
is called the flux (flow) of this quantity. Examples are the flux 
(flow) of a liquid through a cross section of a pipe or tube, and the 
light flux through a window pane or through the glass bulb of an 
electric lamp. We can consider the flux through a surface of any 
shape; in particular, the surface can be closed. 

The flux is a scalar algebraic quantity. The sign of a flux is deter- 
mined by the choice of the positive direction, for example, the direc- 
tion of the axis along which the flux propagates. The positive direc- 
tion is usually chosen arbitrarily. For closed surfaces, it is customa- 
ry practice to consider the flux flowing out of the surface as positive, 
and that flowing into it as negative. 

In this chapter, we shall deal with fluxes through flat surfaces 
perpendicular to the jz-axis. If particles, energy, or momentum will 
be transported through the surface in the direction of the z-axis, we 
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shall consider the corresponding flux to be positive, otherwise we 
shall consider it negative. 

Every transport phenomenon is due to changes in a certain quan- 
tity / in space. This quantity for the transport of particles (diffusion) 
is the concentration of the particles — the latter are transported in 
the direction of diminishing of their concentration. A heat flux ap- 
pears when the temperature at different points of the medium differs, 
the heat flowing in the direction of diminishing temperature, etc. 

We shall consider for simplicity that the quantity / whose lack 
of homogeneity underlies the given transport process (the concen- 
tration, temperature, etc.) is a function of only the coordinate z. 

Hence, the change in this quantity 



in space will be characterized by 
the derivative dfldz . The latter is 
usually called the gradient of the 
quantity /. This name is not quite 
correct — strictly speaking, the deri- 
vative of the scalar function /=/ (z) 
with respect to z gives the projection 
of the gradient of the function onto 
the z-axis [see Eq. (3.23)1. Follow- 
ing the tradition, however, we 
shall call quantities of the kind 
dfldz in a transport equation a 


Fig. 16.1 


gradient. 

Diffusion. Diffusion is defined 


as the spontaneous levelling out 
of the concentrations in a mixture of several (in the simplest case 
of two) different substances due to thermal motion. This process is 
observed in solid, liquid, and gaseous media. We shall consider only 
gaseous media. 

Assume that a unit volume of a two-component gas mixture con- 
tains n x molecules of one species and n 2 molecules of another one. 
The total number of molecules in unit volume is n = n x + rt 2 . The 


ratio 



is called the relative concentration of the molecules of the i-th species. 

Let us assume that concentration gradients dcjdz and dcjdz are 
set up in the direction of the z-axis, and dcjdz == — dcjdz 
(Fig. 16.1). Hence 

4 ('.+«•)- 1 S ' ("■+"«)- 0 

so that n and, consequently, p are constant (p = nkT). Therefore, 
no gas-dynamical fluxes appear. But owing to the thermal motion 




Physical Kinetic* 


407 


of the molecules, the process of levelling out of the concentrations 
will occur attended by the transport of the mass of each of the com- 
ponents in the direction of the diminishing of its concentration. As 
indicated above, this process is called diffusion. 

It has been established experimentally that the flux of molecules 
of the t-th species through surface S perpendicular to the z-axis is 
determined by the expression 

N t = -D-^-S (16.1) 

where D is a constant of proportionality called the diffusion coeffi- 
cient. 

According to Eq. (16.1), when dnjdz > 0, the flux N t is negative; 
this signifies that the molecules are transported in a direction oppo- 



(a) 


Fig. 



16.2 


site to that of the z-axis (Fig. 16.2a). When drtildz < 0, the flux is 
positive, i.e. the molecules are transported in the direction of the 
z-axis (Fig. 16.26). Thus, the minus sign in Eq. (16.1) is due to the 
fact that the molecules flow in the direction of diminishing of the 
concentration. 

The dimension of the flux of molecules N is 1/T, that of n x is 
1/L 3 , of the area S is L 2 , and dz has the dimension L. Hence the 
diffusion coefficient has the dimension L a /T. 

Multiplying both sides of Eq. (16.1) by the mass of a molecule of 
the i-th species m iy we get an expression for the flux of the mass of 
the i-th component: 

M t — — D-^~- S (16.2) 

Here p f = niirii is the partial density of the i-th component; it. is 
also called the absolute concentration. 

Equations (16.1) and (16.2) are empirical equations of diffusion. 
They are also called Fick’s law. 

Thermal Conductivity. Experiments show that if we set up a 
temperature gradient along the z-axis in a medium (solid, liquid, 
or gaseous one), then a heat flux is produced whose magnitude is 
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determined by the formula 

g=-x^-5 (16.3) 

where q = heat flux through surface S perpendicular to the z-axis 
dT/dz = temperature gradient (more exactly, the projection of 
the temperature gradient on the z-axis) 
x = proportionality constant depending on the properties 
of the medium and called the thermal conductivity 
coefficient. 

The unit of q is J/s, i.e. W (watt). Hence, x is measured in watts 
per metre-kelvin [W/(m- K)]. The minus sign in Eq. (16.3) signifies 
that the heat flows in the direction of diminishing of the tempera- 
ture. Therefore, the signs of q and dT/dz are opposite. 

Equation (16.3) is an empirical equation of thermal conduc- 
tivity. It is also called the Fourier law. 

Internal Friction. According to Eq.(9.12), the force of friction 
between two layers of a fluid is 

'-il-trl* <«•«> 

where r\ = viscosity (viscosity coefficient) 

du/dz = quantity showing how rapidly the velocity of the fluid 
changes in the direction z perpendicular to the direction 
of motion of the layers (the gradient of u) 

S = surface area over which the force F acts. 

Equation (16.4) is the empirical equation of viscosity. 
According to Newton’s second law, the interaction of two layers 
with the force F can be considered as a process in the course of which 
a momentum equal to F in magnitude is transmitted from one layer 
to another in unit time. Therefore, Eq. (16.4) can be written in the 
form 

K--i>w S < 16 ' 5 > 

where K is the momentum transmitted in one second from layer 
to layer through surface 5, i.e. the momentum flux through S . 

The momentum flux K is measured in kg- m/s 2 . Hence, the unit 
of the viscosity r\ is the kilogramme per metre-second [kg/(m-s)l. 
[This unit can also be written in the form pascal-second (Pa-s).] 
The minus sign in Eq. (16.5) is due to the circumstance that the 
momentum “flows” in the direction of the decrease in the velocity 
u. Therefore, the signs of the flux K and of the derivative duldz 
are opposite. 

It must be remembered that Eq. (16.4) determines the identical 
modulus of two oppositely directed forces with which the layers act 
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on each other. Therefore, the minus sign must not be written in front 
of the right-hand side of Eq. (16.4). In addition, we must take the 
magnitude of the expression du/dz (the magnitude of the force with 
any sign of the derivative du/dz must be positive). 


16.2. Mean Free Path 

The molecules of a gas in thermal motion continuously collide 
with one another. The term M collision” as applied to molecules must 
not be understood literally and the process conceived as the colli- 
sion of rigid balls. By a col- 
lision of molecules is meant 
the process of interaction 
between them as a result of 
which the molecules change 
the direction of their 
motion. 

Figure 16.3 shows a 
curve depicting the mutual 
potential energy of two 
molecules as a function of 
the distance r between their 
centres. Let us use this 
curve to consider the process 
of the approach (collision) 
of molecules. Let us men- 
tally place the centre of 
one of the molecules at the 
origin of coordinates, and 
imagine the centre of the 
second molecule moving along the r-axis. Assume that the second 
molecule is flying toward the first one from infinity having the 
initial store of kinetic energy Approaching the first mole- 

cule, the second one under the action of the force of attraction moves 
with a constantly growing velocity. The kinetic energy of a molecule 
e k also grows as a result. The total energy of the system equal to 
e = e k + e p , however, remains unchanged (the system of the two 
molecules is closed) and equal to e x because the potential energy 
e p diminishes simultaneously. When the molecule passes the point 
with the coordinate r 0 , the forces of attraction are replaced by forces 
of repulsion, owing to which the molecule begins to rapidly lose its 
velocity (in the region of repulsion the curve of e p is very steep). 
At the moment when the potential energy e p becomes equal to the 
total energy of the system e x , the velocity of the molecule vanishes. 
At this moment, the molecules approach each other to the closest 
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distance. After the molecule stops, all the phenomena proceed in 
the reverse sequence: first the molecule travels with a constantly 
growing velocity under the action of the force of repulsion, upon 
covering the distance r 0 , the molecule gets under the action of the 
force of attraction that retards its motion and, finally, travels away 
to infinity having its initial store of kinetic energy 

The minimum distance between the centres of two molecules 
when they collide is called the effective or collision diameter of 
a molecule d (Fig. 16.4). The quantity 

o = nd* (16.6) 

is known as the effective section of a molecule. 

A glance at Fig. 16.3 shows that when a molecule begins its mo* 
tion from infinity with a greater store of energy, the minimum 
distance between the centres of the molecules 
when they approach is less (compare d x and 
d 2 in the figure). Thus, the collision diameter 
of molecules depends on their energy and, con- 
sequently, on the temperature. The collision 
diameter of molecules diminishes with increasing 
temperature. 

During one second, a molecule travels an av- 
erage path equal to the mean velocity (v). If 
it experiences an average of v collisions a second, then the mean 
free path will be 

(16.7) 

To calculate the average number of collisions v, let us first as- 
sume that all the molecules except for a given one are frozen still in 
their places. Let us watch the motion of the molecule we have ear- 
marked. After striking one of the stationary molecules, it will fly in 
a straight line until it collides with another stationary molecule 
(Fig. 16.5). This collision will occur if the centre of the stationary 
molecule is at a distance from the line of flight of the molecule less 
than the collision diameter of a molecule d . As a result of the colli- 
sion, the molecule will change the direction of its motion, and will 
then again fly along a straight line for a time. This will continue 
until it encounters another molecule whose centre will be within 
the cylinder of radius d shown in Fig. 16.5. 

The molecule travels a path of (v) in one second. The number of 
collisions with stationary molecules occurring during this time equals 
the number of molecules whose centres are within the bent cylinder 
of length {v) and radius d . It will be shown below that the mean 
free path is much larger than the collision diameter of the molecules 
d. Therefore, the volume of the cylinder can be considered equal to 
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nd* (v). Multiplying this volume by the number of molecules in a 
unit volume n, we get the average number of collisions of a moving 
molecule with stationary ones per second: 

v' = suP {v) n (16.8) 

Actually, all the molecules are in motion, owing to which the 
number of collisions is determined by the mean velocity of motion 
of the molecules with respect to one another, and not by the mean 



velocity <w> of molecules relative to the walls of the vessel confining 
them. The relative velocity of two arbitrarily taken molecules is 

Vrei=*V a — Vi 

Squaring this equation, we get 

vU\ = (v 2 — Vi ) 2 = v\ + v\ — 2v 1 v 2 

(we have taken advantage of the fact that v 2 = v 2 ). The mean value 
of the sum of several quantities equals the sum of the mean values 
of the quantities being added. Hence, 

<«4l> = (v\) + (v\) — 2 (VjVj) 

Events consisting in that a first molecule has the velocity v 2 and 
a second one the velocity v g are statistically independent. Therefore, 
( v i v a) — (vjXvj). Each of the multipliers equals zero for a gas in 
equilibrium. Thus, 

(the mean value of the square of the velocity of all the molecules is 
the same and equals (i^». The result obtained signifies that i> r eLm. s q = 
= V^m.sq- The mean square velocities are proportional to the 
arithmetical mean ones. Consequently, 

<iw = V? (v) 
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Substituting <t; rel > for (v) in Eq. (16.8), we get the following 
expression for the mean number of collisions: 

v = 1^2 nd 2 ( v ) n (16.9) 


Using this value of v in Eq. (16.7), we get the following equation 
for the mean free path: 


l 


1 

1 / 2nd*n 


(16.10) 


Substituting o for ncP in the above equation in accordance with 
Eq. (16.6), we get 


l 


1 

1/2 an 


(16.11) 


At constant temperature, n is proportional to p. Hence, the mean 
free path is inversely proportional to the pressure: 

l OC ± (16.12) 


We noted above that the collision diameter of molecules diminishes 
with increasing temperature. Accordingly, elevation of the tempera- 
ture is attended by an increase in the free path. . 

Let us assess the value of the mean free path and the average num- 
ber of collisions a second. We established in Sec. 10.2 that mole- 
cules have dimensions of the order of several angstroms. Let us adopt 
the collision diameter of a molecule equal to 2 A, i.e. 2 x 
X 10 -10 m. A mole of a gas in standard conditions (i.e. at 0 °C and 
p = 1 atm) occupies a volume of 22.4 X 10 -3 m 3 . The number of 
molecules in unit volume in these conditions is 6 x 10 23 : 22.4 X 
X 10“ s «3 X 10 26 m -3 . Introducing these numbers into Eq. (16.10), 
we have 


l = 


1 

l/jTxS.MXiXlO^XSx 10»s ^ 


2 x 10~ 7 m= 2 X 10~ 6 cm 


At a pressure of 10 -3 mmHg (which corresponds to about 10~® atm), 
l will be of the order of 10 cm. If a vessel has dimensions of a few 
centimetres, then at such a pressure the molecules will travel from 
wall to wall virtually without colliding with one another. At a pres- 
sure of 10“* mmHg l reaches a value of scores of metres. 

When deriving Eq. (16.8), we assumed that l is much greater 
than d. Now we can see that this assumption is correct. Indeed, it 
follows from the above assessment that in standard conditions the 
ratio of l to d is about 2 x 10 -5 : 2 x 10 -10 = 10 6 . 

We can find the number of collisions a second by dividing the 
mean velocity of the molecules ( v ) by l. In Sec. 11.6, we obtained 
a value of (v) of about 500 m/s for oxygen. Dividing this value by 
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/ = 2 X 10 -7 m, we get a value of 2.5 X 10® s -1 for the number of 
collisions a second. Thus, in standard conditions, the number of 
collisions a second is several thousand millions. The number of col- 
lisions diminishes with a decreasing pressure, changing in propor- 
tion to p [see Eq. (16.12)]. 

16.3. Diffusion in Gases 

We shall attempt to get an equation of diffusion proceeding from 
molecular-kinetic notions. To simplify our task, we shall consider 
that the molecules of both components differ only slightly in mass 
(/»! « m 2 « m) and have virtually identical effective sections 
(Oj « o t a# a). In this case, we 
can assign the same mean 
velocity of thermal motion (u> 
to the molecules of both com- 
ponents and calculate the 
mean free path by the equation 


/2o« 

where n = n t + n 2 . 

It is easy to understand 
that the process of diffusion Fig. 15 5 

in gases will proceed more 

intensively when the molecules have a higher velocity (v) and 
collide less frequently with one another (i.e. the free path l is 
greater). Consequently, we can expect that the diffusion coeffi- 
cient D must be proportional to the product (v)l. This agrees with 
the fact that, as we noted in Sec. 16.1, the dimension of D is L 2 /T. 

Let us begin our calculations. Assume that the change in the 
concentration of the first component along the z-axis is described 
by the function n ± = (z). Let us denote the number of molecules 

of the first component flying through area S in the direction of the 
z-axis a second by N[, and the number of molecules flying in the 
direction opposite to the z-axis by (Fig. 16.6)*. The difference 
between these numbers is the flux of the molecules of the first compo- 
nent Wi through surface S: 

N^N' — Nl (16.13) 

We shall proceed from the simplified notion according to which 
the molecules move along three mutually perpendicular directions 

* We have drawn Fig. 16.6 so that the molecules fly through the upper 
half and the molecules iVJ through the lower half of area S. Actually, both 
groups of molecules are distributed over the entire surface S. 
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coinciding with the axes x f y> z (the axes x and y are parallel to area 
S ). In this case according to Eq. (11.24), the number of molecules 

flying one of the directions through unit area a second is g n (v)» 
Hence, the numbers N[ and N\ can be represented in the form 

K = | n[ (v) S , N" = 4 ^ <"> 5 (16.14) 

where n[ = “effective” concentration of the molecules of the first 
component to the left of S 
n\ = the same to the right of S . 

Molecules will fly through surface S that experienced their last 
collision at different distances from it. On an average, however, 
the last collision will occur at a distance from S equal to the mean 
free path Z. It is therefore reasonable to take the value n x (z — Z) 
as nj, and n x ( z + Z) as n\ (see Fig. 16.6). Hence, with a view to Eqs. 
(16.13) and (16.14), we can write that 

< y > s t" 1 ( z - *) - n i ( z + z )i < 16 - 15 > 

Since l is very small, the difference between the values of the func- 
tion n x ( 2 ) given in Eq. (16.15) in brackets can be written in the 
form*. 

n x (z — l) — n, (z + Z)= — 2* (16.16) 

Introducing this expression into Eq. (16.15), we find that 

A comparison of Eqs. (16.17) and (16.1) shows that on the basis 
of molecular-kinetic notions we can not only arrive at a proper de- 
pendence of N x on dn x /dz , but also obtain an expression for the diffu- 
sion coefficient D . This expression has the form 

D = ±(v)l (16.18) 

More strict calculations lead to the same formula, but with a some- 
what different numerical factor. 


* Equation (16.16) holds provided that the change in n x over the free path 


is much smaller than n x itself 



This condition gives a criterion for 


the smallness of the deviation from equilibrium (see the fourth paragraph of 
Sec. 16.1). 

This remark relates to similar formulas of the following two sections. For 
example, Eq. (16.23) holds provided XhtX — l < T. 
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* It must be noted that as we assumed, the diffusion coefficient is 
proportional to the product ( v)l . 

The derivation that led us to Eq. (16.17) can be applied with equal 
rights to both components of a mixture. Hence, the diffusion coeffi- 
cient has the same value for both components. 

Let us investigate the expression we have obtained for the diffu- 
sion coefficient D. Inserting in Eq. (16.18) the expressions for (u) 
and l y we find that 


It follows from Eq. (16.19) that the diffusion coefficient is inversely 
proportional to the number of molecules in a unit volume, and, con- 
sequently, to the pressure p : 


With elevation of the temperature, D grows approximately in pro- 
portion to YT (we remind our reader that a slightly depends on 

T). 

We have assumed that the molecules of both components are iden- 
tical in mass and effective section. Therefore, Eq. (16.18) is in es- 
sence an expression for the coefficient of self-diffusion, i.e. the diffu- 
sion of the molecules of a gas in a medium containing molecules of 
the same gas. The phenomenon of self-diffusion could be observed 
if we marked in some way or other part of the molecules of a homo- 
geneous gas. If the concentrations of marked molecules and of the 
molecules bearing no marks were not constant, counterflows of the 
two kinds of molecules would appear in the gas, and the magnitude 
of the flows would be determined by Eq. (16.17). Self-diffusion can 
be studied in practice by employing the tracer technique. It consists 
in using a mixture of isotopes, i.e. varieties of atoms of the same ele- 
ment differing from each other, for example, in that one variety of 
the atoms is radioactive and the other is stable. 

When the molecules of the two components of a mixture differ 
in their mass and in effective section, the diffusion coefficient is 
determined by the expression 

t\ n i ( y a) h + n a ( y i) 

3 (*! + **) 

where n lt ( v i) r = concentration, mean velocity, and mean free 

path of the molecules of the first component 
n tt ( v 2 >, 1% = the same quantities for the molecules of the 
second component. 


416 


Molecular Phygics and Thermodynamics 


16.4. Thermal Conductivity of Gases 


Let us calculate the heat flux in a gas on the basis of molecular* 
kinetic notions. If the temperature of the gas differs at different 
points, then the mean energy of the molecules at these points will 
also differ. Being displaced owing to thermal motion from one set 
of points to another, the molecules transport the energy they have 
stored. This energy transfer underlies the process of thermal conduc- 
tivity in gases. 

Before beginning our calculations, let us attempt to reveal the 
factors that can affect the ability of a gas to conduct heat. It is easy 

to understand that apart from 
the factors determining the rate 
of diffusion, i.e. the mean velo- 
city of the molecules (v) and 
the free path i, the amount of 
energy transported by the mole- 
cules must depend on the ability 
of the molecules to store energy, 
i.e. on the heat capacity of the 
gas. 

Let us consider a gas in which 
a temperature gradient is main- 
tained in some way or other 
along the direction which we 
have denoted by the letter z. Let us mentally imagine area S 
perpendicular to this direction (Fig. 16.7). 

On the basis of simplified notions, we shall consider that the num- 
ber of molecules flying in one second through area S in each direc- 
tion (from left to right and from right to left) is 



AT = ~-rc(i;>£ 


(16.20) 


At constant pressure, n depends on the temperature (p = nkT ), 
and ( v ) also changes with the temperature. Accordingly, it would 
seem that to find the number of molecules flying through area S 
from left to right we ought to use in Eq. (16.20) the values of n and 
(v) corresponding to one temperature, and to find the number of 
molecules flying from right to left, the values of n and (v) correspond- 
ing to another temperature. The numbers of molecules flying 
through area S in opposite directions cannot differ, however. If 
they did, then apart from the heat flux through area S , we would 
also observe a flow of matter — the gas would be transported from 
one part of space to another. But we assumed that no processes occur 
in the gas except for the transport of heat. Therefore, we shall use 
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Eq. (16.20) to calculate the number of molecules flying through 
S in each direction, assuming for n and (u) their values in 
section S. 

It must be noted that since n = p/kT y i.e. n is proportional to p/T y 
and (i;> is proportional to Y T , the constancy of the product n { v > 
signifies the constancy of the expression 



Hence, for no flow of molecules to be observed when a temperature 
gradient is present, it is essential that the pressure change along the 
z-axis in proportion to Y T. 

In calculating the heat flux, we shall proceed from the assumption 
that every molecule carries with it the energy e = ^ kT correspond- 
ing to the temperature at the spot where its last collision with 
another molecule occurred. On the average, this collision occurs at 
a distance from S equal to the mean free path l (see Fig. 16.7). Thus, 
the energy (ei> corresponding to the temperature T x =T(z — Z), 
i.e. to the temperature in the plane (z — Z), should be ascribed 
to the molecules flying in the direction of the z-axis, and the 
energy (e 2 > corresponding to the temperature T 2 = T (z + Z) should 
be ascribed to the molecules flying in the opposite direction (here 
z is the coordinate of plane S). 

In accordance with the above, we get the following expression for 
the heat flux through area S in the positive direction of the z-axis: 

q = N(( ei)-(e 2 » 

where N is determined by Eq. (16.20). Introduction of the values 
of N , (ej), and (e 2 > yields 

q = 1 n (V) S ( -i kT t - 4- kT s ) = -i n (v) S ± k (7\ - T z ) (16.21) 

The difference 7\ — T 2 equals 

T (z _ V) - T (z + 1) = - -g- 21 (16.22) 

(we have taken into account the smallness of l). Here dT/dz is the 
derivative of T with respect to z at the location of plane 5. 

With a view to Eq. (16.22), we can write Eq. (16.21) as follows: 

(V)S 4 *4^21= -4 <»)l(y*»)4r s < i6 ' 23 > 

A comparison of this equation with Eq. (16.3) gives the following 
expression for the thermal conductivity coefficient: 

x = T {v)l {i kn ) 


(16.24) 
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It should be remembered that the expression | R — | kN A 
determines the heat capacity at constant volume C v of one mole of 
a gas, i.e. the amount of a gas containing N A molecules. Similar- 
ly, the expression | kn is the heat capacity of the amount of a gas 

containing n molecules, i.e. the heat capacity of a unit volume of 
the gas. We can obtain this heat capacity by multiplying the spe- 
cific heat capacity c v (the heat capacity of a unit mass) by the mass 
of a unit volume, i.e. by the density of the gas p. Thus, 

-^kn = pc v (16.25) 

Introducing Eq. (16.25) into (16.24), we arrive at the final expres- 
sion for the thermal conductivity coefficient of a gas: 

x = ~ (v) lpc v (16.26) 

As we have expected, the thermal conductivity coefficient was found 
to be proportional to (v) 9 Z, and the heat capacity of a gas pc v . More 
strict calculations lead to a similar expression for x, but with a 
somewhat different numerical factor. 

Let us find how x depends on the quantities characterizing mole- 
cules and on the parameters of state of a gas. Taking into account 
that ( v > is proportional to ]/ Tim , Z is proportional to 1 /ncf 9 and 
pc v is proportional to in [see Eq. (16.25)1, we get 

O 6 - 27 ) 

Inspection of expression (16.27) shows that unlike the diffusion 
coefficient, the thermal conductivity coefficient of a gas does not 
depend on the number of molecules in a unit volume, and, there- 
fore, on the pressure (p = nkT). This is due to the following reasons. 
Reduction of the pressure is attended by diminishing of n , i.e. of 
the number of molecules participating in the transfer of energy. 
Simultaneously, Z grows and, consequently, also the difference 
between the energies transported by every molecule in opposite 
directions. As a result, we see that the amount of energy trans- 
ferred by the molecules at the given temperature gradient does not 
depend on the pressure. This holds only as long as Z remains small 
in comparison with the distance between the surfaces exchanging 
heat at the expense of the thermal conductivity of the gas contained 
between them (for example, in comparison with the size of the gap 
between the internal and external walls of a glass vacuum bottle). 
As this condition stops being observed, the thermal conductivity 
begins to depend to a greater and greater extent on the pressure. 
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diminishing when the latter lowers. When l exceeds the distance 
between the surfaces, the path of the molecules is determined by the 
magnitude of this distance and stops depending on the pressure. The 
number of molecules per unit volume continues to fall off with de- 
creasing pressure, the result being a reduction in x. 

When the temperature is raised, the thermal conductivity coeffi- 
cient grows somewhat more rapidly than V T. The cause is that the 
effective section a depends slightly on T (see Sec. 16.2). 


16.5. Viscosity of Gases 


% 


Uf 

% 


To understand the origin of forces of internal friction, let us consid- 
er two contacting layers of a gas having a thickness of A z. We shall 
assume that the layers move with different velocities u* and u 2 
(Fig. 16.8). Every molecule of the 
gas participates in two motions: 
chaotic thermal motion whose mean 
velocity is (p), and ordered motion 
with the velocity u that is much 1 \ 
smaller than (i?>. 

Suppose that at a certain mo- Fig. 16.8 

ment the layers have the momenta 

K x and K%. These momenta cannot remain unchanged because owing 
to thermal motion the continuous transition of molecules from one 
layer to another occurs. According to our simplified notions, the 
number of molecules passing through area S from one layer to 
another a second is determined by the expression 


N = y n (v) S 


(16.28) 


(the insignificant effect of ordered motion on the magnitude of the 
velocity of the molecules may be disregarded). 

Upon getting into another layer, a molecule collides with the 
molecules of that layer. As a result, it either gives up its surplus 
momentum to other molecules (if it arrived from a layer moving 
with a greater velocity), or increases its momentum at the expense 
of the other molecules (if it arrived from a layer moving with a small- 
er velocity). As a result, the momentum of the faster layer dimin- 
ishes, and of the slower one grows. The layers thus behave as if a 
retarding force is applied to the first layer (whose velocity is greater), 
and an accelerating force equal in magnitude is applied to the 
second layer (whose velocity is lower). 

The momentum transferred through area S on the interface be- 
tween the layers depicted in Fig. 16.8 in unit time from the first 
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layer to the second one is 

K = N (muj — mu t ) 

(m is the mass of a molecule). Introduction of Eq. (16.28) for N 
yields 

K = n (v) Sm (u t — u 2 ) (16.29) 

In a real gas flow, the velocity when crossing the interface between 
two layers changes not in a jump, but continuously according to the 

law u = u (z) (Fig. 16.9). We 
shall consider that every mole- 
cule flying through surface S 
carries along with it the mo- 
mentum mu determined by the 
value of the velocity u at the 
spot where the last collision of 
the molecule occurred. Different 
molecules experience their last 
collision at the most diverse 
distances from S . On the ave- 
rage, this collision occurs at a 
distance equal to the free path L 
We shall therefore ascribe the 
velocity = u (z — l) to the 
molecules flying in the direction 
of the z-axis, and the velocity 
u 2 = u ( z + 0 ^e molecules flying in the opposite direction. 
Inserting these values in Eq. (16.29), we get the following expression 
for the momentum flux in the direction of the z-axis: 

K = n (u) Sm [u(z — l) — u (z + Z)] = — n ( v ) Sm 21 

[compare with Eq. (16.23)1. 

Taking into account that the product nm equals the density of 
a gas p, we can write that 

*--({« »p)-ar« 

A comparison with Eq. (16.5) gives an expression for the viscosity: 

T] = -g-<i;)Zp (16.30) 

Stricter calculations lead to the same expression, but with a some- 
what different numerical factor. 

It can be seen from Eq. (16.30) that like D and x, the viscosity 
is proportional to ( v ) and L It is also proportional to the density 
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of a gas p, i.e. to a quantity characterizing the ability of a gas to 
“accumulate” momentum — at a given velocity u the momentum of 
a unit volume of a gas is the greater, the higher is the density p 
(we remind our reader that the thermal conductivity is proportional 
to the heat capacity of a unit volume of a gas). 

Taking into account the expressions for the quantities in 
Eq. (16.30), we can write that 


T)°C 



Y mT 
a 


Hence, it follows that like x, the viscosity does not depend on the 
pressure. This holds only as long as l remains small in comparison 
with the size of the gap through which the gas is flowing (for exam- 
ple, in comparison with the diameter of a pipe). As this condition 
stops being observed, the viscosity begins to depend more and more 
on the pressure, diminishing when the latter drops. The viscosity 
t] depends on the temperature in the same way as D and x. 


16.6. Ultrararefied Gases 

When the free path of molecules exceeds the linear dimensions 
of the vessel confining them, we say that a vacuum has been achieved 
in the vessel. The gas in this case is called ultrararefied. Al- 
though the word vacuum literally means “emptiness”, an ultrarare- 
fied gas contains a great number of molecules in a unit volume. Thus, 
at a pressure of 10 ~ 6 mmHg, one cubic metre contains about 10 16 
molecules. Moreover, in very minute pores, the state defined as a 
vacuum can also be achieved at atmospheric pressure. 

The behaviour of ultrararefied gases is distinguished by numerous 
features. For conditions of a vacuum, we cannot speak of the pres- 
sure of one part of a gas on another. In ordinary conditions, the mole- 
cules often collide with one another. Consequently, any surface 
with which we mentally divide a gas into two parts will experience 
an exchange of momenta between molecules. Thus, one part of the 
gas will act on the other with the pressure p over the interface. In 
a vacuum, the molecules exchange momenta only with the walls of 
the vessel, so that only the concept of the pressure of a gas on a wall 
has a meaning. Internal friction is also absent in the gas. But a 
body moving in an ultrararefied gas will experience the action of a 
friction force due to the fact that the molecules colliding with this 
body will change its momentum. Let us consider this in greater de- 
tail. 

Assume that two plates are moving parallel to each other in an 
ultrararefied gas (Fig. 16.10). The velocities of the plates are Ux 
and u 2 . The interaction between a molecule and a plate at the mo- 
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ment of a collision leads to the molecule, upon rebounding from 
the plate, having a velocity component equal in magnitude and 
direction to the velocity of the plate in addition to its thermal veloc- 
ity. 

A unit area of the upper plate will be struck in one second by 
molecules having a velocity component u 2 acquired in the 

preceding collision with the lower plate. Each of these molecules 
carries a momentum component of mu 2 . Upon rebounding from the 
upper plate, the molecules have a momentum component of mu x . 



~ 1 T f 

I ft 


Fig. 16.10 


Fig. 16.11 


Consequently, a collision of every molecule with the upper plate 
results in its momentum diminishing by m (u x — u 2 ). The change 
in the momentum in unit time related to unit surface area of the 
plate is 

-g- n {v) m (u x — U 2 ) 

This change equals the force acting on a unit surface area of the 
plate: 

F = -^P(v)(u 1 — u 2 ) (10-31) 

(we have substituted p for nm). A force of the same magnitude, but 
opposite in direction, acts on a unit surface area of the lower plate. 

It is natural to call the proportionality constant between the force 
of friction and the velocity difference of the plates the coefficient 
of friction. It can be’ seen from Eq. (16.31) that this coefficient 

equals g p(i>), i.e. is proportional to the density of the gas and, 

consequently, to the pressure of the gas on a plate and the walls of 
the vessel (the expression p =nkT remains in force for this pres- 
sure). 

Let us now consider the transfer of heat by a gas in a vacuum. 
We shall consider two plates with temperatures T x and T 2 between 
which there is an ultrararefied gas (Fig. 16.11). If the impact of the 
molecules against the surface of the rigid body were of a perfectly 
elastic nature, the molecules would rebound from a plate with the 
same velocity in magnitude (and, consequently, with the same ener- 
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gy) as they had before the collision. As a result, the molecules would 
not be able to transfer energy from one plate to the other. Such a 
conclusion, however, contradicts experimental data. Hence, the 
interaction between a wall and a molecule striking it is not an elas- 
tic collision in nature. Indeed, it occurs as follows: upon striking 
a wall, a molecule adheres to it, as it were, for a brief time, after 
which it leaves the wall in an absolutely arbitrary direction with 
a velocity whose magnitude on the average corresponds to the tem- 
perature of the wall*. 

Let us revert to Fig. 16.11. Each of the ( v)S molecules strik- 

ing the upper plate a second carries the energy | kT 2 along with 

it and carries away an energy of ~ kT x . Hence, each impact of a 
molecule against a plate leads to the latter losing the energy 
^ k(T t — T 2 ). The second plate receives the same energy upon each 

impact. Thus, the amount of energy transferred by the molecules 
in one second from plate to plate will be 

q = ± n (v)±k(T 1 ~ T 2 )S 

Multiplying and dividing this expression by mN A , we get 

? = |p<y)cv(ri-r 2 )5 (16.32) 

The thermal conductivity coefficient equal to ^ p ( v)c v is found 

to be proportional to the density of a gas when the latter is ultra- 
rarefied. Hence, heat transfer from one wall to another will fall off 
with decreasing pressure, whereas the thermal conductivity of a 
gas in ordinary conditions does not depend, as we have seen, on 
the pressure. 


* We must note that this more precise definition of the nature of interaction 
of the molecules with a wall does not affect the results which we obtained in 
Sec. 11.4 when calculating the pressure. If the temperature of the gas and the 
walls is the same, then the molecules will leave a wall with the same mean 
velocity with which they collided with it, so that the change in the momentum 
of the molecules as a result of a collision is the same on an average as in a per- 
fectly elastic collision. 
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A.l. List of Symbols 

A amplitude; atomic mass; work 
a distance 

a acceleration*; vector 
B amplitude 
b distance; thickness 
b vector 

C curvature; molar heat capacity 

c distance; relative concentration; specific heat capacity; speed 
of light 
c vector 

D diffusion coefficient 

d collision diameter; diameter 

d vector 

E energy; mechanical equivalent of heat; Young’s modulus 
e base of natural logarithms 

e unit vector 

F free (Helmholtz) energy 
F force 

F* non-conservative force 

/ coefficient of friction; relative fluctuation 
f force 

G Gibbs thermodynamic potential (Gibbs energy); gravitational 
constant; shear modulus 
g acceleration of free fall 

g' gravitational field vector, gravitational intensity 
H curvature; enthalpy 

h height; Planck’s constant 
h Planck’s constant divided by 2n (h/2n) 

I moment of inertia (7^7 

Im imaginary U 

i imaginary unity ( i = V — 1) 

K momentum (also p); momentum flux 

k Boltzmann constant; constant of proportionality; quasi-elastic 
force coefficient; spring constant 
L heat of transition 


* The magnitude of a vector is denoted by the same symbol as the vector 
itself, but inordinary italic (sloping) type. 
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L angular momentum 
L dimension of length 

l length; mean free path of molecule 

M mass 

M moment of force (torque) * * 

M dimension of mass 
m mass 

N number of molecules, particles, etc. 

N a Avogadro constant 

n number of molecules or particles per unit volume; polytropic 
exponent 

P power; probability 
P force of gravity 
p pressure 

p momentum (also K) 

Q amount of heat; quality; rate of flow 
q heat flux 

R molar gas constant; radius of curvature 
Re Reynolds number 
Re real 

r radius; resistance coefficient 
r displacement; position (radius) vector 
S area; entropy 
s distance; “interval” 

T absolute temperature; period of revolution 
T dimension of time 
t temperature, Celsius scale; time 
U internal energy 
u velocity 

V potential function; volume 
v velocity 
W weight 
w energy density 
x coordinate 
y coordinate 

z complex number; coordinate 

a angle; coefficient of elastic compliance; constant; initial phase 
of oscillations 
a angular acceleration 

P angle; damping factor; refrigerating factor (coefficient of per- 
formance) 

y angle; ratio of heat capacities C p /Cvl relative shear 
A increment 
A' elementary amount 
e energy; strain 
r\ efficiency; viscosity (dynamic) 



0 thermodynamic temperature 
0 angle; temperature 
x thermal conductivity coefficient 

A volume of cell in t?-space 

X logarithmic decrement 

reduced mass 

v frequency; kinematic viscosity; number of revolutions per unit 
time 

n ratio of circumference to diameter, n = 3.14+ 
p density; polar coordinate 

a effective section of molecule; stress; surface tension 

x proper time; tangential or shear stress 

t unit vector of tangent to trajectory 

cp angle; polar coordinate 

Q solid angle; statistical weight 

(o cyclic frequency 

e> angular velocity 


A. 2. Calculation of Selected Integrals 


1. The improper integral 


+ 00 


/ (P) = j exp ( — pa: 2 ) dx 


(A.l) 


is called the Poisson integral. Denoting the integration variable 
by the letter y , we can write this integral in the form 


+ oo 


/(p)= j exp( — p y*)dy 

— oo 


Multiplying the two expressions, we arrive at the double integral 

-J-oo +oo 

l/(p)] 2 = j exp( — P x z )dx j exp ( — Py 2 ) dy = 

— OO — oo 

-J-oo -|-oo 

= j j exp [ — P(a: 2 +y 2 )l dxdy (A. 2) 

— oo — oo 

It is easy to calculate this integral by considering the variables 
x and y as Cartesian coordinates in a plane and passing over from 
these coordinates to the polar ones r and q>. When x and y vary 
from — oo to +oo, the coordinate r varies within the limits from 0 
to oo, and <p within the limits from 0 to 2 ji. The sum rr 2 + y 2 equals 
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r 2 , while the surface element dx dy in polar coordinates has the 
form r dr c?q>. Performing this substitution in Eq. (A. 2), we arrive 
at the expression 

2« oo 

II(P)] 2 = j <*q> J exp ( — pr 2 ) r dr = 2n - = -5* 
o o 


Hence for the integral (A.l), we get the value / P) = V^n/p. Thus, 



(A.3) 


2. Both sides of Eq. (A.3) can be considered as a function of the 
parameter P. Differentiating with respect to this parameter (at the 
left we differentiate the integrand function), we find that 

- 4*00 

j exp( — p x 2 )x 2 dx = j-y r ~ (A.4) 

— oo 

Repeated differentiation with respect to P yields 

+ OO 

j exp ( - p**) ** dx = -J y f. (A.5) 

— oo 

The integrand functions in the integrals (A.3), (A.4), and (A.5) 
are even ones. Therefore, the contributions to these integrals of the 
intervals l — oo, 0] and [0, +oo] are the same. Hence it follows 
that, for example, 

oo 

jexp(-p ,x*)x*dx = £y (A- 6) 

o 


A.3. The Stirling Formula 

With great values of AT, we can obtain a simple approximate for- 
mula for N !. In accordance with the definition of Wl, we have 

N 

In W! = In (1 X 2 X . . . X A) = In 1 + In 2 + . . . + In A = 2 In m 


The sum we have written equals the sum of the areas of the columns 
depicted in Fig. A.l. At great values of N, the sum of the areas of 
these columns differs very slightly from the area confined by the 


28 * 
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Fig. A.l 

dash curve, which is a graph of the function In x. Hence, 

N 

In N\ « J In x dx — [x In x — «= iV In A — A + 1 

1 

With great values of N, we may disregard unity, and we arrive at 
the formula 

In ATI « N In N — N (A. 7) 

which is called the Stirling formula. 

We must note that, strictly speaking, the Stirling formula has 
another addend equal to V 2 In (2 nN). With great values of N, 
however, this addend may be disregarded in comparison with the 
other two addends. 


NAME INDEX 


Avogadro, A., 279 


Bernoulli, D., 253 
Bohr, N., 14 
Boltzmann, L., 327 
Braginsky, V. B., 181 


Carnot, S M 350 
Cavendish, H., 174 
Chaplygin, S. A., 265 
Clausius, R., 347, 348, 349 


De Broglie, L., 14 
Debye, P. W., 375 
Dicke, R. H., 181 


Einstein, A., 13, 51, 122, 181, 222, 
375 

Eotvos, R., 179, 180, 181 
Euler, L., 249 


Fedorov, Y. S., 368 
Feynman, R. P., 55, 56 
Frenkel, Ya. I., 376, 377 


Gagarin, Yu. A., 15 
Galileo Galilei, 63 


Heisenberg, W., 14 


Kelvin, Lord (W. Thomson), 12, 349 
Khaikin, S. E., 55 


Lagrange, J. L., 249 
Lammert, J., 325 


Laplace, P. S., 382 
Lenedev, P. N., 247 
Leighton, R. B., 56 
Lenin, V. I., 11, 13, 265 
Lomonosov, M. V., 267 


Maxwell, J. C., 316, 326 
Mendeleev, D. I., 393 
Michelson, A. A., 222 
Morley, E. W., 222 


Narozhny, N. B., 338 
Newton, I., 12, 51, 172, 222 


Panov, V. I., 181 
Perrin, J., 329 
Planck, M., 13 
Pound, R. V., 248 


Rebka, G. A., Jr., 248 
Reynolds, O., 258 


Sands, M., 56 
Schrodinger, E., 14 
Sena, L. A., 58 
Stern, 0., 324, 325 
Stokes, G. G., 264 


Thomson, W. (Lord Kelvin), 12, 349 
Torricelli, E., 254 
Tsiolkovsky, K. E. f 255 


Van der Waals, J. D., 291, 394 
Vavilov, S. I., 11 
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SUBJECT INDEX 


Acceleration, 43 
angular, 49 
Coriolis, 129 
free fall, 73 

determination, 202 
variation with latitude, 124 
in non-inertial reference frame, 120 
normal, 44, 49 
tangential, 43 
transferable, 129 
units, 58 
Adiabat, 285 

and isotherm, 285f 
Amplitude, 

oscillations, 196 
in resonance, 219 
Angle, contact, 383f 
Angstrom (A), 269 
Angular momentum, 75, 108 
and angular velocity, 1463 
body, 138f 

in central force field, 114 
particle, 108f, 114 
system, 108, 112f 
Anisotropy, 366 

and phase transition, 404 
Arm, 

couple. 111 
force, 110 
lever, 110 
moment, 110 
momentum, 108f 
Auto-oscillations, 185, 214f 
Axis(es), 
free, 140 

principal, of inertia, 140 
rotation, 18, 133 
instantaneous, 134f 
symmetry, 368 


Barrier, potential, 101 
Beats, 2043 

Body(ies), see also Substance 
amorphous, heating curve, 397f 
angular momentum, 138f 
with axial symmetry, 140 
with central symmetry, 140f 


Body(ies), 

centre of mass, 136 
equation of motion, 55 
equilibrium, 160f 
free axis, 140 
homogeneous, 137 
inertia tensor, 148, 150 
internal energy, 271 
isotropic, 366 
moment of inertia, 138 
motion, see also Motion 
equation, 55 
in ideal fluid, 262 
in viscous fluid, 263 
non-homogeneous, angular momen- 
tum, 138f 

perfectly rigid, 18, 136 
degrees of freedom, 308 
in plane motion, kinetic energy, 
157f 

principal axes of inertia, 140 
principal moments of inertia, 140f 
rotating, kinetic energy, 1533 
rotation, 133, see also Rotation 
rotational inertia, 138 
thermometric, 280 
translation, 133 
Bond, 

covalent, 370 
heteropolar, 370 
homopolar, 370 
ionic, 370 


Calorie (cal), 273 
Capillarity, 385f 
Cell, crystal, 

elementary, 366f, 369 
unit, 366 
Centre(s), 

condensation, 396 
crystallization, 398 
of gravity, 159f 
of inertia, 103 
of mass, 103 
motion, 136 

Chamber, Wilson cloud, 396 
Clocks, synchronization, 223f 



Coefficient, 

compliance, 66 
diffusion, 407, 413, 414f 
friction, 70 

ultrararefied gases, 422 
internal friction, 256, see also Vis- 
cosity 

performance, 346f 
resistance, 188 
self-diffusion, 415 
thermal conductivity, 408, 417ff 
ultrararefied gases, 423 
viscosity, see Viscosity 
Collision, 

central, 105f 

completely inelastic, 104f 
perfectly elastic, 104, 105f 
central, 105f 
Complex number(s), 190ff 
absolute value, 190 
addition, 191 
argument, 190 
complex conjugate, 190 
division, 191 
exponential form, 191 
imaginary part, 190 
modulus, 190 
multiplication, 191 
real part, 190 
trigonometric form, 190 
Concentration, 
absolute, 407 
relative, 406 
Condensation, 389 
Conductivity, thermal, 408, 417ff 
Constant, 

Avogadro, 268 

determination, 329f 
Boltzmann, 280 
gravitational, 172, 173f 
molar gas, 279 
Planck’s, 36 
spring, 65 
Contraction, 

Fitzgerald, 23 if 
Lorentz, 23 If 
Couple, 
arm, 111 
force, HOf 
Crystal(s), 
atomic, 370f 
classification, 368ff 
defects, 372ff 
linear, 372 
point, 372 

elementary cell, 366f, 369 
parameters, 367 


Crystal(s), 

heat capacity, 374f 
ionic, 370 
lattice, 368 
kinds, 370ff 

symmetry elements, 368f 
liquid, 376 
metallic, 371 
molecular, 371f, 374 
primitive cell, 367 
translation period, 366 
Crystallization, 398f 
Curvature, 44 
centre, 44 
radius, 44 
Curve (s), 

evaporation, 401f 
fusion, 398 

heating, amorphous body, 397f 
melting, 398, 401f, 404 
resonance, 220 
sublimation, 401f 
Cycle, 271, 345 
Carnot, 350f 
efficiency, 351ff 
ideal gas, 353f 
for two-phase system, 399f 
refrigerating machine, 346 


Decrement, 78 
damping, 212 
logarithmic, 212 
Deformations, elastic, 64f 
Degree, 

absolute, 356 
Celsius, 278 
of freedom, 100 
body, 308 

molecules, 310, 312 
rotational, 308, 310ff 
translational, 308, 310ff 
vibrational, 309ff 
Density, 142f 

energy, elastic deformation, 99 
at given point, 143 
ideal gas, 280 
Derivative, partial, 87 
Diagram, 
phase, 402ff 
p-T, 388, 390 
p-V, 276, 351, 353, 391f 
r-S 9 350f 
vector, 203f 

Differential, total, 88, 274, 359 
Diffusion, 406f 
in gases, 413ft 
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Dislocations, 372f 
edge, 372 
screw, 372, 374 
Displacement, 36 
body, 133 
Distance, 36 
Distribution, 

Boltzmann’s, 327f 
Maxwell’s, 316ff, 324ff 
Drag, 262, 263 
friction, 263f 
pressure, 263f 
Dynamics, 19, 51 
Dyne (dyn), 58 


Effect, gyroscopic, 165f 
Efficiency, 

Carnot cycle, 351ff 
heat engine, 346 
Elongation, relative, 66 
Energy, 75 
centrifugal, 131f 
density, 99 
dissipation, 91 
elastic deformation, 98f 
free, 363, 365 
liquid, 380 
Gibbs, 364 

harmonic oscillation, 198f 
Helmholtz, 363 
internal, 271, 362, 365 

ideal gas, 281f, 284, 311, 361 
van der Waals gas, 292f, 361 
and volume, 360 
kinetic, 77f 

body in plane motion, 157f 
harmonic oscillation, 198f 
rotating body, 153ff 
mechanical, total, 87 
of system, 97f 

molecules, 307, 311, 314, 327 
particle, in central force field, 
115ff 

photon, 247f 
potential, 85, 87, 89f 
and altitude, 327 
in central force field, 115 
harmonic oscillation, 198f 
interaction, 93ff 
separate body, 98f 
spherical layer and particle, 170ff 
zero, 89 

relativistic expression, 240ff 
rest, 242 
rotational, 314 
total, 87, 97f, 116f, 242 


Energy, 

transformations, 243ff 
vibrational, 314 
Engine, heat, 346 
Ensemble, statistical, 295 
Enthalpy, 364f 
Entropy, 

change in melting, 359 
ideal gas, 357f 
isolated system, 337, 343 
monatomic ideal gas, 338ff 
non-isolated system, 337 
system, 337 
water, 359 
Equation(s), 
adiabat, 285 
Bernoulli’s, 252f 

for horizontal streamline, 253 
characteristic, 194 
Clapeyron-Clausius, 400 
elementary process, 273 
invariance, 222 
isotherm, 284 

linear differential, 189, 192ff 
motion, 
body, 55 
particle, 76 
Poisson, 285 
polytrope, 286f 
of state, 278 

ideal gas, 279, 280 
van der Waals, 291f, 395 
Equilibrium, 

liquid-solid, 398 
liquid-vapour, 389ff 
rigid body, 160f 
stable, 101 
unstable, 101 
Erg, 81 

Evaporation, 389 
Events, simultaneity, 230 
Experiment, 

Cavendish’s, 174 
Edtvds’s, 179f 
Lammert’s, 325f 
Perrin’s, 329f 
Stern’s, 324f 

Exponent, polytropic, 287 


Factor, 

damping, 211 
refrigerating, 3461 
Field, 
force, 82 

central, 82, 113 
homogeneous, 82 
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Field, 

force 

non-stationary, 82 
potential, 85 
stationary, 82 
gravitational, 174ff 

homogeneous sphere, 178 
velocity vector, 249 
Figures, Lissajous, 209f 
Flow, see also Flux 
laminar, 258, 260f 
steady, 249f, 259 
turbulent, 258, 260f 
Fluctuation, 
mean square, 334, 335 
relative, 334f 

Fluids, see also Gas(es), Liquid(s) 
boundary layer, 263 
flow velocity, 260f 
incompressible, 251 
Flux, 405 
negative, 405f 
positive, 405f 
Force (s), 55f 
arm, 110 

characterization, 74 
conservative, 82f, 89 
Coriolis, 125ff 
couple, HOf 
arm. 111 
dissipation, 91 
elastic, 64ff 
electromagnetic, 64 
external, 75 
field, 82, 85, 113 
friction, 64, 68ff, 85 
internal, 255ff 
fundamental, 64 
gravitational, 64 
gravity, 71, 83f, 124, 159 
and weight, 73 
gyroscopic, 166 
inertia, 120ff 

centrifugal, 122ff, 131, 180 
interaction, conservative, 93 
internal, 75 
lifting, 262 

moment, 109, 110, 158f 
non-conservative, 85, 98 
normal, 68f 
quasi-elastic, 187 
resistance, of medium, 70 
restoring, 187 
stratification, 68f 
surface tension, 379 
units, 58 

van der Waals, 372 


Formula, 

barometric, 294, 326 
dimension, 59f 
Euler, 191 
Laplace, 382 
Poiseuille, 261 
Stirling’s, 339, 427f 
Stokes, 264 
Toricelli, 254 

Frame, reference, ue Reference frame 
Frequency, 
circular, 197 
cyclic, 197 

measurement result, relative, 295 
natural, 210 
oscillations, 197, 201 
resonance, 219, 221 
Friction, 
dry, 68ff 
external, 68 
internal, 68, 70, 408 
liquid, 68 
rolling, 70 
sliding, 69f 
viscous, 68, 70 
Function(s), 

derivative of product, 33ff 
potential, 85f 

probability distribution, 297 


Gas(es), see also Fluids 
ideal, see Ideal gas(es) 
perfect, see Ideal gas(es) 
pressure on wall, 304ff 
real, 279, 290ff 
standard conditions, 279 
thermal conductivity, 416ff 
ultrararefied, 421ff 

friction coefficient, 422 
thermal conductivity coefficient, 
423 

van der Waals, 292 

internal energy, 292f, 361 
viscosity, 420f 
Gradient, 88, 406 
Graph, bar, 296f 
Gyrocompass, 168 
Gyroscopes, 165ff, see also Top 


Heat, 

amount, 272f 

capacity, see Heat capacity 
mechanical equivalent, 273 
received in reversible process, 351 
transfer, 272 
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Heat, 
unit, 273 

vaporization, specific, 389 
Heat capacity, 281f 

at constant pressure, 283, 312, 364 

at constant volume, 282, 312, 41$ 

fusion, 397 

ideal gas, 288 

molar, 281 

specific, 281 

and temperature, 313, 375 
transition, 388 
Hertz (Hz), 197 
Histogram, 296f 
Hydrodynamics, 249 
Hypothesis, 

de Broglie’s, 14 
definition, 12 
ergodic, 332 

Ideal gas(es), 279 

adiabat equation, 285 
density, 280 
entropy, 3385, 357f 
equation of state, 279, 280 
heat capacity in polytropic proc- 
ess, 288 

internal energy, 281f, 284, 311, 361 
isotherm equation, 284 
work, 2885 
Inertia, 53 
Integral(s), 
motion, 75 
Poisson, 426 

Intensity, gravitational, 174f 
Interactions, kinds, 64 
Interval, 225, 2346 
imaginary, 235, 236 
real, 235 
spacelike, 236 
timelike, 235 
Isotherm (s), 284 
van der Waals, 394f 


Joule (J), 81, 273 


Kelvin (K), 278 
Kilogramme (kg), 58 

international prototype, 58 
Kinematics, 19, 51 
Kinetics, physical, 405 

Law(s), 

angular momentum conservation, 
75f, 112 


Law(s), 

Avogadro’s, 279 
Boltzmann distribution, 327f 
conservation, in non-inertial ref- 
erence frames, 131f 
Coulomb’s, 64 
dry friction, 69f 
Dulong and Petit, 375 
energy conservation, 75f, 91, 98, 272 
entropy increase, 337 
Fick’s, 407 
Fourier, 408 
Hooke’s, 65 
inertia, 52 

Maxwell-Boltzmann distribution, 
328 

Maxwell distribution, 3166 

experimental verification, 3245 
mechanical energy conservation, 98 
momentum conservation, 54, 75f, 
103, 238f 
Newton’s, 

applications, 73f 
first, 52 

second, 55f, 188f 
for non-inertial reference frame, 
121 

relativistic expression, 240 
third, 60f 

relativistic mass-energy relation, 246 
thermodynamics, 

first, 272, 274, 277, 283, 345 
second, 337, 347, 349 
third, 344 

universal gravitation, 1726 
velocity addition, 237 
Length, 

in di6erent reference frames, 230f 
proper, 231 
Lift, 262, 264f 
Limit, elastic, 65 
Line, world, 224 
Liquid(s), 376ff, see also Fluids 
density and temperature, 392 
flow, 

from bole, 2536 
in round pipe, 2596 
free energy, 380 
ideal, 251, 255 
motion, description, 2496 
non-viscous, 251 
quasicrystalline structure, 376 
rate of flow, 261 
shape, 378 
supercooled, 398f 

superheated, 395, 396f tensioned, 
395, 397 
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Machine(s), 

perpetual motion, 
first kind, 345 
second kind, 349 
refrigerating, 346 
reversible, 355 
Macrostate, 330 

statistical weight, 331 f, 338 
thermodynamic probability, 331 
Mass, 53 
atomic, 267 
relative, 267 
Earth* s, 182 
gravitational, 1785 
inertial, 1783 
molar, 268 
molecular, 267 
relative, 267 
reduced, 118 
relativistic, 54, 240 
rest, 54, 240 
particle, 246 
system, 246 
units, 58f 

Materialism, dialectical, 11 
Matter, kinds, 11 
Mechanics, 
classical, 51f 
Newtonian, 14, 51 
quantum, 13, 14, 51f 
relativistic, 51 
Meniscuses, 385 
Metre (m), 57 
Microstate, 330, 336 
Modulus, 

elasticity, 67 
shear, 68 
Young’s, 67 
Mole (mol), 268 
Molecule(s), 

collision(s), 4093 
diameter, 410 
number, 4113 
with wall, 3013 
distribution, 

by altitude, 327 
by velocities, 3155 
efiective diameter, 410 
efiective section, 410 
energy levels, 314 
mean energy, 307, 311 
mean free path, 410, 412 
mean velocity, 3203 
number of degrees of freedom, 310, 
312 

and temperature, 313f 
potential energy and altitude, 327 


Molecule(s), 

rotational energy, 314 
size, 268f 
velocities, 298f 

experimental determination, 3245 
vibrational energy, 314 
Moment, 
arm, 110 

force, 109, 110, 158f 
gravity, 159 
of inertia, 138, 1423 
axial 149, 150 
centrifugal, 1493 
disk, 146 

principal, 140f, 152 
sphere, 146 
thin long rod, 146 
Momentum, 54, 75 

angular, see Angular momentum 
arm, 108f 

moment of, 108, see also Angular 
momentum 

relativistic expression, 238f 
system, 1023 
transformations, 2435 
Monocrystals, 366 
Motion, 

Brownian, 266, 329 
in central force field, 1135 
circular, 18f, 465, see also Rota- 
tion 

uniform, 48, 208 
finite, 101 
infinite, 101 

and infiuence of Coriolis force, 129f 
interacting particles, 1175 
liquids, description, 2495 
mechanical, 173 

in non-inertial reference frame, 131 

non-planar, 135 

plane, 133f 

reactive, 255 

rigid body, 158 

translational, 18f 

uniform, 37 


Newton (N), 58 
Non-wetting, 
complete, 384 
partial, 384 
Number(s), 

complex, see Complex number(s) 
degrees of freedom, 100 
Reynolds, 258f 

use as similarity number, 264 
Nutation, 170 
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Operator, nabla, 88 
Order, 

long-range, 376 
short-range, 376 
Oscillation(s), 
amplitude, 196 
damped, 189, 210S 

E eriod, 211 
nition, 185 

forced, 185, 189, 215ff 
steady-state, 218 
free, 185, 189 
frequency, 197, 201 
harmonic, 186, 195ff 
addition, 203 

mutually perpendicular, addition, 
206ff 

natural, 185, 189 
parametric, 186 
period, 196f, 201f, 211 
phase, 196 
initial, 196 

small-amplitude, 186ff, 200 

Parameters of state, 269 
Particle(s), 

acceleration, 43 
angular momentum, 108f, 114 
in central force field, 113ff 
displacement, 36 
energy, 1151! 
equation of motion, 76 
interacting, motion, 117ff 
point, definition, 18 
potential energy, 115 
reduced mass, 118 
rest mass, 246 
trajectory, 116f 
velocity vector, 114 
with zero rest mass, 247f 
Pascal (Pa), 67, 257 
Pendulum, 199ff 
Foucault, 130 
mathematical, 199 
physical, 201f 
centre of oscillation, 202 
reduced length, 202 
simple, 199, 201f 
Period, 

oscillations, 196f, 201f, 211 
and damping factor, 211 
revolution, 48 
Phase, 388 
equilibrium, 388 
oscillations, 196 
transition. 388 
and volume, 400f 


Phenomena, transport, 405ff 
Photon, 247 
energy, 247f 
Physics, 
classical, 12f 
molecular, 266 
statistical, 266 
Plane, 

cleavage, 366 
symmetry, 368 
Point, 

critical, 393 
melting, 398 
and pressure, 400f 
triple, 388, 401ff 
world, 224 
Poise (P), 257 
Polycrystals, 366 
Potential, 85 
gravitational field, 175 
thermodynamic, Gibbs, 364f 
Power, 81 
units, 81 
Precession, 169f 
pseudoregular, 170 
regular, 170 
Pressure, 

capillary, 382, 386 
light, 247 

under liquid surface, 382 
saturated vapour, 390 
Principle, 

constancy of light speed, 222 
Einstein’s relativity, 222 
equipartition, 310f 
equivalence, 181 
Galileo’s relativity, 63 
Heisenberg’s uncertainty, 30f 
Probability, 295 
thermodynamic, 331 
Process, 270 
adiabatic, 284ff 
cyclic, 271 
elementary, 273 
equilibrium, 270 
irreversible, 336, 337, 344, 405 
entropy change in, 348 
isentropic, 350 
isobaric, 284, 287, 364 
isochoric, 284, 287f, 290 
isothermal, 284, 288, 363 
polytropic, 286f 
quasistatic, 270 
relaxation, 269 
reversible, 270. 342f, 347f 
heat received, 351 
Pump, water-jet, 253 
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Quality (<?), oscillatory system, 
Quantity(ies), 
critical, 393 
fluctuations, 334 
invariant, 225 
physical, 
basic, 56 
dimension, 59f 


Radius, molecular action, 291 1 
Reaction, 71 
discharged flow, 255 
Reference frame, 17 
centre-of-mass, 104 
heliocentric, 52 
inertial, 52, 63 
laboratory, 104 
non-inertial, 52, 120ff 
Relaxation, 270 
Reservoir, 

high temperature, 349 
low temperature, 350 
Resistance, head 
Resonance, 219ff 
parametric, 221 
Rotation, 18f 
axis, 133 

instantaneous, 134f 
body, 133 
stable, 141 
and translation, formulas compared, 
155 

unstable, 141 
Rule, see also Law(s) 
right-hand screw, 46f 
velocity addition, 63 


Scale, temperature, see Temperature 
scale 

Second (s), 58 
Self-diffusion, 415 
Shear, 67 
modulus, 68 
relative, 66 
Shift, red, 248 
Sink, heat, 350 
Solid, amorphous, 377, 399 
Source, heat, 349 
Space, 

four-dimensional, 224, 244f 
isotropy, 76 
pseudo-Euclidean, 225 
three-dimensional, 224f 
uniformity, 75f 
velocity, 315 


Speed, 37, see also Velocity(ies) 
units, 58 

Sphere, molecular action, 377 
State(s), 
critical, 393 

equilibrium, 269f, 336, 364 
metastable, 395S 
non-equilibrium, 269 
Strains, elastic, 64f, 66 
Streamlines, 249f 
Stress, 66 
normal, 66 
shear, 66f 
tangential, 66 

Structures, quasicrystalline, 376 
Sublimation, 389 
Substance, see also Body(ies) 
amorphous, 377 
critical state, 393 
thermometric, 280 
Surface, mean curvature, 381f 
Symbol, Kronecker, 27 
System, 269 

angular momentum, 108 
closed, 75 

angular momentum, 112 
momentum, 102ff 
coordinate, 19 
crystallographic, 369 
cubic, 369, 370 
hexagonal, 369 
monoclinic, 369 
rhombic, 369 
rhombohedral, 369 
tetragonal, 369 
triclinic, 369 
trigonal, 369 
entropy, 337, 343 
internal energy, 271 
isolated, 53 
macroscopic, 295 
mechanical, 17 

degrees of freedom, 308f 
equilibrium, lOOf 
momentum, 102S 
rest mass, 246 

total mechanical energy, 97f 
unclosed, 

angular momentum, 112f 
momentum, 103 


Temperature, 277 
absolute, 278 
and molecule energy, 307 
critical, 392f 

scale, see Temperature scale 




Temperature scale, 
absolute, 278 
Celsius, 277 
empirical, 281 
ideal gas, 281 
international practical, 278 
thermodynamic, 278, 356 
Tension, surface, 379f 
Tensor, 

components, 148ff 
inertia, body, 148, 150 
rank two, 148 
symmetrical, 150 
Theorem, 

Carnot, 352 

on flow continuity, 250f 
Maclaurin, 186 
Nemst’s, 344 
parallel axis, 144f 
probability multiplication, 296 
probability summation, 295 
Steiner, 144 

Theory, molecular-kinetic, 266 
Thermodynamics, 266f 
Thermogen, 267 
Thrust, 255 
Time, 

dilation, 233 

length between events, 232 
proper, 232f 
relaxation, 270 
uniformity, 75 
Tops, 165 
asymmetrical, 141 
spherical, 141, 152, 153, 156f 
symmetrical, 141, 152 
free, 171 
Torque, 109 
Trajectory, particle, 36 

in central force field, 116f 
Transformations, 

Galilean, 62 
Loren tz, 62, 229 
corollaries, 230ff 
Transition, phase, 388, 400f 
Translation, 18 
body, 133 

and rotation, formulas compared, 
155 

Tube, flow, 250 
Unit(s), 

atomic mass (amu), 267 
basic, 57, 58 
derived, 58 

prefixes for multiples and sub- 
multiples, 81f 


Unit(s), 

systems, 

absolute, 57 
cgs, 58 

International (SI), 57 
mk(force)s, 58 


Vaporization, 389 
Vapour, 

saturated, 390 

density and temperature, 392 
pressure, 390 
supersaturated, 395f 
Vector(s), 

acceleration, 46 

addition, 21 

axial, 29 

collinear, 21 

components, 25 

coplanar, 21 

cross product, 29 

definition, 20f 

derivative, 32fl 

diagram, 203f 

dot product, 26 

energy-momentum. 245 

expression through projections, 25 

four-, 244f 

free, 21 

gravitational field, 174 
linear relation between, 23 
localized, 21 
magnitude, 21 

multiplication by scalar, 22f 
polar, 29 
position, 25f 
projection, 24 
properties, 74 
pseudo-, 29 
rotation, 47 
scalar product, 26ff 
scalar triple product, 31 
slip, 21 

subtraction, 21f 
true, 29 
unit, 23 

derivative, 32ff 
vector product, 28ff 
vector triple product, 31 ff 
velocity, 39ff 
field, 249 

in polar coordinates, 114 
Velocity(ies), 39ff 
addition, 237 
angular, 47 
definition, 37 
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Velocity (ies) 
escape, 183f 

from solar system, 184 
first cosmic, 182f 
mean square, 321 
most probable, 321 
orbital, 182f 
second cosmic, 183f 
sector, 114 
space, 184 
tangential, 182f 
third cosmic, 184 
transformation, 237 
units, 58 

Vibrations, see Oscillation(s) 
Viscosity, 255f, 408 
determination, 261 
dynamic, 256, 259 
ases, 420f 
inematic, 256, 259 
and temperature, 258 
units, 257f, 408 
Volume, specific, 391 


Watt (W), 81 
Weight, 72 
Well, potential, 101 
Wetting, 
complete, 384 
partial, 384 
Work, 77, 78ff, 274f 
centrifugal force of inertia, 
conservative forces, 90 
in cycle, 345f, 400 
in cyclic process, 276 
ideal gas, 28811 

adiabatic process, 289 
isobaric process, 290 
isothermal process, 289f 
isochoric process, 290 
isothermal process, 363 
non-conservative forces, 90 
units, 81 

and volume changes, 275f 


Zero, absolute, 278 
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